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M335 
Introduction 


‘Mathematics is the queen of the sciences and number theory the queen of 
mathematics.’ Gauss 


The Overall Course 


This course is one of the options for the Open University half credit course 
M335: Studies in Pure Mathematics. For information about the overall course, 
including instructions concerning the continuous assessment and final 
examination, see Guide to the Course M335. 


The Set Textbook 


The set textbook for the course is the 1980 Open University Edition of 
Elementary Number Theory by David M. Burton, published by Allyn and 
Bacon. It is essential that you possess a copy of this edition. 


It is inconceivable that for a course of this length we could complete the whole 
of Burton’s book, and consequently some parts have had to be missed out. To 
get the right foundations we must start at the very beginning of the book, but 
we also felt that we should like to reach the later chapters, which cover some of 
the more famous and interesting topics in the subject. To achieve this we have 
pursued a careful path through the text, omitting several sections from the 
middle parts of the book. 


Of course, there is no reason why you should not read for yourself the sections 
that we have chosen to omit. We emphasize that nothing has been left out 
because it is in any way unsuitable (or uninteresting!), but merely because of the 
need to present an acceptable total amount of material for assessment purposes. 


The Course Units 


The overall material has been divided into eight more or less equal parts which 
we present as ‘units’. Each unit, including the associated assessment materials, is 
intended to involve you in about 10-12 hours work. 


Our unit texts consist of guided tours through the relevant parts of the set 
textbook, which we refer to as B (for Burton). They comprise commentary and 


notes on the text of B, supplementary text, worked examples (the ends of which 
are marked [7], if necessary) and self-assessment questions (SAQs) with solutions 
(the ends of which are marked B). Additional exercises for each unit appear in 
the Exercise Booklet, bound separately from the main unit text. 


Each unit is subdivided into sections which, as a rule, correspond to sections of 
B. There are however several instances where we have broken long sections of B 
into parts. Most units have four or five sections corresponding very roughly to 
an ‘evening’s work’ each. Unit 1 is an exception, having eight shorter sections. 
Examples and SAQs are numbered in order, starting at 1 in each unit. 


The referencing in our texts will always clarify whether we are referring to B or 
to one of our units. The reading passages are clearly indicated; for instance 


READ B from page 54, line 11 to the end of Section 3.2. 


Most reading passages are accompanied by some notes in the unit text, the 
notes being referenced by a line in B. (Line —n denotes the nth line up from the 
bottom of the page: the end of the note is marked A.) 
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The Role of Problems 


Whilst number theory is arguably the purest of all subjects within mathematics, 
you must not form the impression that it is all about development of theories 
and proofs of theorems. On the contrary, number theory is about problem 
solving. Nobody masters the calculus by just reading about it; it is essential to 
practise differentiation and integration on a large number of examples. The 
same is true of number theory. You cannot get to grips with number theory 
without solving problems yourself. 


To this end we have included a good stock of self-assessment questions in all 
units and you should always have pencil and paper at hand. The Examples in 
the units are for you to read, but the SAQs are for you to do. Do not spend 
forever on a problem which looks like defeating you. If you get stuck you 
should refer to the solution that is there—but do not give in too easily! 


Further problems (and their solutions) on each unit can be found in the 
Exercise Booklet. It is not intended that you should work through all of these. 
If you feel confident that you have mastered a certain section then there will be 
no need to attempt further problems on that section. On the other hand, where 
a section has proved to be heavy going the additional problems will be most 
valuable. 


The Exercise Booklet also contains a number of challenge problems. These are 
intended only for the ‘high fliers’ who enjoy a hard challenge, being pitched at a 
level above the expected attainment from the course. 


Course Structure 


The course has a linear structure. By this we mean that the units should be 
studied in order, for each is in some way dependent on its predecessors. The 
exception to this rule is Unit 7 (on which the first section of Unit 8 depends). 
This stands alone and can be studied at any time after Unit 3. 


Tape Sections 


Four of the units contain tape sections. For these you will need access to a 
cassette player. In working these sections you will, as always, need pencil and 
paper handy, and a pause or stop button within easy reach. 


Calculator 


The nature of number theory has changed dramatically over the past few years, 
due to the advent of the computer. Computations which were well nigh 
impossible only a few years ago can now be managed with ease, with the result 
that substantial numerical evidence supporting (or toppling) outstanding 
conjectures is being gathered all the time. Although the material in this course 
presents many challenges for those with access to a computer, we have written 
the course to avoid the computational side of the subject, thereby making it 
accessible to all. There are, however, many problems in the course which will be 
simplified if you possess a calculator and, although this is not essential, it is 
advisable that you have one. The calculator recommended for your Foundation 
Course will be more than adequate. 


IWLIII > 
Course Team 
The course has been written by Alan Best and Bob Coates, with contributions 


from David Asche, Andrew Brown, Francis Coghlan, Stanley Collings, David 
Crowe (Editor) and Hossein Zand. 


Dedication 


To the memory of Don Mansfield, 


UNIT I 
BASIC IDEAS 


1.0 


1.1 
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Introduction 


This first unit comprises Chapters 1 and 2 of B (the set book by Burton). The 
objective of the unit is to lay a firm foundation for the whole course by 
examining in detail the fundamental ideas which form the basis from which to 
study properties of numbers. 


Chapter 1 reviews two topics: proof by mathematical induction and the Binomial 
Theorem. You will probably already be familiar with both these ideas (although 
you might find the approach to mathematical induction in B somewhat different 
from, for example, that in our Foundation Course). The chapter ends with a 
section on the early history of number theory. Throughout B we shall encounter 
these short, interesting historical extracts which, although not forming an 
assessable part of the course, we encourage you to read. 


From an early age we all learn how to divide one integer by another obtaining 
à quotient and a remainder. The result we use here is known as the Division 
Algorithm and is expounded at the beginning of Chapter 2. As you will quickly 
appreciate, the consequences of this simple idea, both practical and theoretical, 
make it a cornerstone of number theory. The later sections of the chapter 
develop these consequences and, in particular, introduce the greatest common 
divisor—a topic which again ought to be familiar and one which is going to 
figure prominently throughout the course. 


If you have a prior knowledge of prime numbers, there might well be a 
temptation to use them in connection with divisibility problems. But notice that 
prime numbers are not introduced in B until Chapter 3, and we do not involve 
them in this unit. The development of the material in B is logically sound. For 
instance, Euclid's Lemma (Theorem 2-5, B page 28) must seem entirely obvious 
and capable of ‘proof’ in a couple of lines to anyone who knows anything at all 
about prime numbers. But in the next chapter, Euclid's Lemma is used 
extensively to establish properties of prime numbers; so to use properties of 
primes to prove Euclid's Lemma would plainly lead to circular arguments. 


In this course we shall develop a lot of theory and investigate many of the 
classical results in the subject. However, the course will contain a heavy 
emphasis on ‘problem solving’, and you will find a plentiful supply of worked 
examples and SAQs (with solutions) in all units. This unit ends with a section 
on problem solving. The text presents a number of problems which are related 
to ideas in the unit but which are slightly more difficult then previous SAQs. 
Guidance on how to go about solving them is on the accompanying tape 
cassette. The solutions themselves appear in the Exercise Booklet. 


Mathematical Induction 


Every teacher and every textbook writer must assume that his students or 
readers come with a minimal amount of prior knowledge of the subject. B 
assumes that you are familiar with certain elementary facts about the integers— 
positive, negative and zero. 


Firstly he assumes that you know the usual rules for addition and 
multiplication, including the cancellation law, i.e. 


if ab — ac where a, b and c are integers and a Æ 0, then b = c. 


Secondly he assumes that you know the rules for inequalities. For example, if 
a>bandc<0 (ie. cis negative) then ac < bc. 
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Lastly he assumes that you have seen the *Well-ordering Principle’ which states 
that if S is a non-empty set of non-negative integers then S has a least element. 
In fact this principle may be new to you but, as you will see, it turns out to be 
very powerful and enables us to deduce as a theorem the Principle of 
Mathematical Induction in all its various forms. 


READ B Section 1.1, page 2, as far as page 6, line 15. 


NOTES (i) Page 2, last line 
In specifying a set, in this case S, B uses the symbol ‘} to stand for ‘such 
that’. From other Open University courses you are probably more familiar 
with the use of *? for this purpose. 


(ii) Page 3, line 14 
This statement of the Principle of Finite Induction might strike you as 
unfamiliar because of the presence of the set S. If so. B's ‘remark’ on page 5 
should explain how the statement relates to the usual way we write out 
proofs using Mathematical Induction. We merely show that 


() the result is true for the integer 1 (the basis for the induction), and 
that 


(i) if the result is true for any positive integer k then it is true for k + 1 
(the induction step). 


The unmentioned set S is the set of all positive integers for which the result 
is true. So proving that S is the set of all positive integers is the same as 
showing that the result is true for all positive integers. A 


SAQ 1 B page 8, Problems 1.1, number 1, parts (a) and (e). 
SAQ2 B page 9, Problems 1.1, number 2. 


SOLUTION (Like B we shall suppress mention of the set S.) 


TO SAQ I (a) The result is true when n = 1 since 


p 
‘ie 2 (basis for induction). 


Assume true for n = k, i.e. 


_k(k +1) 


142434 +k 5 


(induction hypothesis). 


Then adding k + 1 to both sides gives 
k(k +1) 


142434:4kt+(k+l= 2 


+(k +1) 
_ (kK + 1k +2) 
E 2 


showing the result to be true when n = k + 1. Therefore by the First 
Principle of Finite Induction the result is true-for all integers n > 1. 


(induction step) 


(e) The result is true for n = 1 since 
1:212 x i " 
[P e (| (basis for induction). 
Assume true for n = k. i.e. 
k(k + 1) 
2 


PB4234--4+h = | | (induction hypothesis). 
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Then adding (k + 1)? to both sides gives 


P +2 +h (k +13 


(“Ss " +(k+ 1? 


(k + 1? (K? + 4k + 4) 
4 


- (e) i (induction step) 


showing the result to be true for n = k + 1. 


Therefore, by the First Principle of Finite Induction, the result is true for all 
integers n> 1. E 


SOLUTION The result is true for n = 1 since 


TO SAQ 2 a(r^' —1) ar—1yr +1) mu E 
EMO US Pe a-car (basis for induction). 
Assume that 
etd à f > 
a+ar+ar? + + ar = ATD (induction hypothesis). 
Adding ar**! to both sides gives 
kti 1 
a tar t ar? +--+ + ark ar! -1 1 : tart! 
— a(r** — 1) + ar** (p — 1) 
r—1 
eo N 
= ap ed] (induction step) 


r—i 
showing the result is true for n =k + 1. 


Therefore, by the First Principle of Finite Induction, the result is true for all 
integers n > 1. 


COMMENT There is in fact an easier way of obtaining this result, without 
using mathematical induction. If we let 


S =a +ar +ar? +- + ar" 
then 
rS=  ar+ar +- tar" + art! 
and subtraction gives 
S—rS =a-—ar"*! 


which produces the required result. W 


READ B page 6, line 16 to line — 4. 


The following example will illustrate a situation where the Second Principle of 
Finite Induction becomes a necessity because of the nature of the problem. 


EXAMPLE 1 Suppose we are given a sequence {a;} of non-negative integers defined by the 
condition 


EIL 
4=1 and a,— Y a, forn>1. 
i=0 


(This complicated looking definition of the terms merely says that each is the 
sum of all the previous ones.) We shall prove, using the Second Principle of 


10 
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Mathematical Induction, that 
a, =2"~', for all integers n > 1. 


Firstly the result is true when n = 1 since a, = 1 and 2! ^! = 1. Now suppose 
the result is true for all the positive integers less than or equal to k, ie. a; = 2^! 
for all integers i such that 1 <i € k. (This is the induction hypothesis and, as 
usual, we are suppressing all reference to the set S of positive integers for which 
the result is true.) 


By definition, 
k k 
Orr = Y, dj do + D, aj. 
i=0 i=1 


Now we use the induction hypothesis, and this is where you should see that it is 
the Second Principle that we really need. 


k 
Oe. =1t+ 27 
i=1 


k 


2F Y 
=1+544 


(by SAQ 2 with a = 1,r =2 and n =k — 1) 


=O 


So the result is true when n = k + 1. This completes the induction step and so, 
by the Second Principle of Finite Induction, a, = 2"~' for all integers n > 1. 


SAQ 3 A sequence {a;}, i > 1 is defined by 
n-1 
a,=3 and a= | P» a) +3, for all integers n > 2. 
i=1 
Show that a, = 3-2"~' for all integers n > 1. 


a, —3 —3:2!^! so the result is true when n = 1. 


SOLUTION Assume the result is true for all the positive integers less than or equal to k, i.e. 
TO SAQ 3 a; — 3:27! for all integers i such that 1 < i < k. Then, by definition, 


k 
Ak+1 -(5 aj) +3 
i=1 


k 
(X is] +3 
i=1 


2k — 
-342l 43 (by SAQ 2 with a = 3, r =2 and n =k — 1) 
= 300% 


So the result is true when n = k + 1, and therefore, by the Second Principle of 
Finite Induction, the result is true for all integers n > 1. Wi 


READ B page 6, line —3 to the end of Section 1.1. 


NOTES (i) Page 7, line 6 
In the generalized form referred to, the statements of the First Principle 
become 


(a) no belongs to S, and 


(b) whenever the integer k > no is in S, then the next integer k + 1 must 
also be in S. 


The conclusion is as given. 


M 


11 


‘EXAMPLE 2 
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(ii) You may have noticed that B sometimes takes the induction step from k to 
k + 1, as in the original statement of the Principle, and sometimes from 
k — 1 to k. In the next section he will go from m to m + 1. All are equally 
valid since all that matters is that the induction step takes us from one 
general integer to its successor. A 


As a final example in this section we shall prove that 2" > n’? for all integers 
n > 10. The induction step here is not as direct as in previous examples. 


The result is true when n = 10 since 
21? = 1024 > 1000 = 10°, 


We next assume that 2" > k? where k > 10 is some integer and our objective is 
to prove that 2**! > (k + 1. Now 


EFL = 2-2 s 2&3 
so if we can show that 2k? > (k + 1)? we are finished. But 
(k +1) =k? 43k? 3k 1 

and (i) 1 « k?/3, since k > 10; 

(ii) as 3 <k, 9 < k? and so, multiplying by : 3k < k3/3; 

(iii) as 10 € k, 3 < k/3 and so 3k? <k3/3, ` 
Combining these facts, 

(k +1)? < k? + k?/3 + 3/3 + k?/3 = 2k? 


as required. 


The induction step in Example 2 could have been tackled in many other ways. 
Here is an alternative approach which illustrates the point that although the 
result to be proved is true for all integers k > 10, the induction step holds for all 
integers k > 4. 


To show 2k? > (k +1)? for k > 4. 
Rearranging the inequality we are required to show that 


(k +1) 


3 
B <2 08 < 2, for k > 4, 


1 
1+; 


, 1 
Now as k increases, n decreases, and so the greatest value of 1 4- 1 /k is given by 
the least value of k, namely k — 4. So 


1)? Ls 
E Es EE = 1.953 < 2, fork > 4 


as required. 


We have the curious situation where the result 2” > n? is true for n > 10, and 
also true for n = 1, (2! > 13), but false for n = 2, 3,...,9. (Try them and see!) 
Yet we have proved that the induction step works for all integers from n = 4 
onwards. But notice that from these facts we cannot deduce that 2" > n? for all 
n= 1 (since the induction step fails for n = 1), nor can we deduce that 2^ > n? 
for all n = 4 (since, despite the fact that the induction step holds, the result is 
false for n = 4). Drawing on B's anology with the dominoes, the situation is 
depicted by 


false true 

te — H—Á——"' 

-a4H] HB AIT ee GMB,. 
res induction stepholds |. — 1 1 e.o 


12 


SAQ 4 


SOLUTION 
TO SAQ 4 


NOTES 
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If any domino after the third is knocked over, all subsequent ones fall. The first 
two knocked over are numbers 1 and 10. 


B page 9, Problems 1.1, number 7. 


The result is true when n = 1 since 1(1!) = 2! — 1. 
Assume the result is true when n = k, i.e. 
1(1!) +2(2!) ++- +k(k!)= (k +1)! — 1. 
Adding (k + 1) ((k + 1)!) to both sides gives 
1(1!) +22!) +--+ (k+1)((k+1))=(k+1)!—1+(k+1)((k+1)) 
=(k+2)(k+1)!-1 
=(k+2)!-—1 


showing the result to be true when n = k + 1. By the First Principle of Finite 
Induction, the result is true for all integers n> 1. B 


1.2 The Binomial Theorem 
This section discusses the Binomial Theorem, which you met in the Foundation 
Course. There we used the notation "C, for the binomial coefficients; B uses the 


alternative notation 


READ B Section 1.2, pages 9-12. 


(i Page 10, line 9 
One way of remembering Pascal’s Rule is to think of the binomial 


coefficient à as being the number of ways of choosing k objects from n. 


There are l H l ways of choosing k objects from n + 1 objects. Either we 
choose the first one or we do not. If we do not then we are left to choose 
the k from the remaining n, which we can do in 3 ways. On the other 
hand, if we do choose the first object we need only choose k — 1 from the 
remaining n, giving the coefficient k i "n 
(ii) Page 12 lines 6—11 

If you have difficulty sorting out B's proof of the Binomial Theorem, keep 


your eye on the term involving b* in the expansion of (a + b)"* !. If k is 
neither 0 nor m + 1 then, by the equation in line 4, this term is the sum of 


(a) a times the term in b* in the expansion of (a + b)" 
and 
(b) b times the term in b*^! in the expansion of (a + b)". 


That is, using the induction hypothesis. 


LUE m n-—kriyk-1 | [P m m-k+1 pk 
JAE ET bt -o kele b 


m+1 
ES qn*i-kpk 
k > 


by Pascaľs Rule. 
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Now k = 0 and k =m + 1 have to be handled separately. If k — 0 then 

b(a + b)” contributes nothing to the term involving b° while if k = m + 1 then 
a(a + b)" contributes nothing to the term involving b”**, and this is why B 
isolates the terms a"*! and b"*!. This is the essence of B's argument, but he 
uses summation notation and handles all values of k at the same time. A 


SAQ S B page 12, Problems 1.2, number 1. 
SAQ 6 B page 12, Problems 1.2, number 4. 


SOLUTION (a) We first observe that k > 0 for the binomial coefficients to be defined. Now 
TO SAQ 5 | 


n n n! n! 
k 5 kai któn - B! ^ x D -k- T) 


(k+1)! (n— E) 
kV ^ Rn-k-1) 


=k+i<n-k 


<>k < }(n — 1), as required. 


(b) Replacing < by = in (a) we see that 


i= Lente 


and as k is an integer this implies that n — 1 is even ie. nis odd. W 


3 
SOLUTION (a) The result is true when n = 2 since ; -1- 5). 
TO SAQ 6 
2 3 k k+1 
Assume that B te H Tec H = | 3 | 


Then, adding i i ') to both sides gives 
2 3 k k+1 k+1 
Be )- ge 


k+2 
= | 3 l by Pascal’s Rule. 
By the First Principle of Finite Induction, the result is true for all integers 
eZ. 


k+1 
2 


(b) Adding together the equations 2 4 +m= m, for m= 2, 3,..., n, we 
obtain 


52 (3) + YLm= ¥ r. 
m=2 m=2 m=2 


2 n(n + 1) e 
a E 


Therefore 2 3 1= Y, n? —1 (as in the result of SAQ 1(a)). 
zi 


That is, 


e 2. 2(n4l)n(n—1l) n(nt 1) 
Ex CRI 2 


2(n  1)n(n — 1) + 3n(n + 1) 
6 


- ini +1) Qn 1) 


14 
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SAQ 7 


SOLUTION 
TO SAQ 7 
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COMMENT You might like to think of the result of SAQ 6(a) in the 


; T. : : 
following way. | is the number of ways of choosing 3 objects from n + 1 


3 
objects. Now either you choose the first object or you do not. If you do choose 
the first object then you choose the 2 further objects from the remaining n 

n 
2 
choose the first object then consider whether or not you choose the second 

object. If you do choose the second object then you choose the 2 further objects 


objects, giving the binomial coefficient | On the other hand, if you do not 


- 2x A , ; -1 
from the n — 1 remaining, giving the binomial coefficient à 2 } and so on. W 


Early Number Theory 
Throughout the book, B includes numerous short historical sections such as the 


next reading passage. The historical perspective given by these sections should 
give you a greater appreciation of the subject itself. 


READ B Section 1.3, pages 13-16. 


B page 16, Problems 1.3, number 1. 


(a) This is precisely the solution of SAQ 1(a), i.e. 


L2 34 enc ED. 
1 , 
(b) Let n be a triangular number, so that n — zi > ) for some integer 


r 2 ]. Then 
8n -- 1—4r(r - 1) c 15 Qr +1}, 

a perfect square. 
On the other hand, if 8n + 1 is a perfect square we can write 8n + 1 = k?, 
for some positive integer k. But 8n + 1 is odd and so k must be odd (the 
square of any even integer being even). So k = 2r + 1 for some integer r = 1 
r(r + 1) 

2 


and 8n + 1 = (2r + 1 = 4r(r + 1) + 1, giving n= ,a 


triangular number. 


r(r + Mx 


(c) Consecutive triangular numbers are d whose sum is 


(r - 1)(r4 2) 
2 


(r + 1), a perfect square. 


r(r + 1) 


(d) Let n = , for some integer r > 1. Then 


1 9r? 49r +2 
9n 1297 yy MT 


_ Qr 1) Gr +2) 
CIE i 
LK 1) 


2 where k = 3r 4-: 1. 


15 
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Similarly, 
25n 4.3 = 8r t 2 6r + 3) 
2 
and 
49n + 6 = Ur t 3)Ur +4) 


2 
both of which are triangular numbers. 
COMMENT The results of parts (b) and (c) can be illustrated as follows. 
(b) 


Here we have taken n — 10 (Le. r = 4), and shown how the 8 triangles of 
dots, plus the dot in the middle, form a square with 9 (ie. 2r + 1) dots along 
each side. 


(c) 


Here we have taken r — 6 and shown how the two triangles give a square 
having 7 (ie. r 4- 1) dots along each side. Bi 


1.4 The Division Algorithm 


This section is devoted to the Division Algorithm. This result tells us that we 
can divide one integer by another to obtain a quotient and a remainder; for 
example, we can divide 23 by 4 to obtain a quotient of 5 and a remainder of 3 
or rather more formally 23 = 5.4 + 3. But more than this, it tells us that the 
quotient and remainder are unique; for the example given above this means that 
if 23 = q-4 + r where 0 < r < 4, then q — 5 and r — 3. This familiar result, 
which you probably learned at a very early age, is of enormous theoretical 
importance and in the remaining sections of the unit we shall develop certain 
consequences of it which are fundamental to our treatment of the theory of 


numbers. 
READ B Section 2.1, pages 20-22. | 


NOTE 


EXAMPLE 3 


SAQ 8 
SAQ 9 
SAQ 10 
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Page 20, line 14 

As the result itself is so familiar, it may come as something of a shock to 
discover how complicated the proof seems. You can get an intuitive grasp of the 
Division Algorithm by imagining all the multiples of the positive integer b 
stepped off along the real line. 


a 
ipi © 


qb (q--l)b 


tt aa 


-2b -b 0 b 2b 3b 


Whatever the value of the integer a it will lie between two adjacent marks. More 
precisely, 


qb € a « (q+ 1)b 
or 
0<a—qb<b. 


Once we put r = a — qb, ie. a= qb + r, this gives the desired result. In the 
proof the existence of such a (unique) q is verified by choosing the one which 
gives the smallest non-negative value for a — qb. A 


We shall show as an example that every sixth power of an integer is of the form 
7k or of the form 7k + 1. 


By the Division Algorithm, any integer can be expressed in the form 7k + r 
where r — 0, 1, 2, 3, 4, 5 or 6, so we proceed to investigate the sixth powers of 
such numbers. By the Binomial Theorem, 

(7k + r)8 = 7- (polynomial in k) + r° 


(since in the expansion of (a + b)®° every term contains a positive power of a, 
with the exception of the final term bô, and we are putting a = 7k and b = r). 


Therefore 
(Tk + 0)° = 7- Po(k) + 0° = 7-Po(k), 
(7k + 1)6 = 7-P,(k) + 19 =7-P,(k) +1, 
(Tk + 2)6 = 7- P4(K) + 26 = 7- P4(k) + 64 7- (P4(k) 4- 9) 1, 
(7k + 3)® = 7- P(k) + 36 =7-P3(k) +729 =7-(P3(k) + 104) + 1, 
(Tk + 46 =7-Py(k) + 4° =7-P,(k) + 4096 =7-(P,(k) + 585) + 1, 
(7k + 5)6 = 7- P4(k) + 5° =7-P5(k) + 15625 = 7 - (P; (k) + 2232) + 1, 
(Tk + 6)6 = 7- PS(k) + 6° = 7- Pe (k) + 46656 = 7-(P6(k) + 6665) + 1. 


Thus all the sixth powers are of the form 7k or 7k + 1. 


The calculations above were worthwhile, if only to illustrate how much labour is 
saved by some of the theorems we prove later. However, there is one 
simplification which is worth a mention. Any number of the form 7k + 6 is also 
of the form 7k — 1 (for a different value of k, of course) and 


(7k — 1)® = 7 - (polynomial in k) + (—1)8 
= 7- (polynomial in k) + 1 
and so we have avoided taking the sixth power of 6. Similar economies are 
obtained by noting that any number of the form 7k + 5 is also of the form 
7k — 2 and so on. 
B page 22, Problems 2.1, number 3. 
Prove that every fourth power of an integer is of the form 5k or 5k + 1. 


B page 23, Problems 2.1, number 6. (This is the version of the Division 
Algorithm which enabled us to make the simplification in Example 3 and in our 
Solution to SAQ 9.) 
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SOLUTION 
TO SAQ 8 


SOLUTION 
TO SAQ 9 


SOLUTION 
TO SAQ 10 


141222 > X. 


(a) The Division Algorithm tells us that any integer is representable in one of 
the four forms 4k, 4k + 1, 4k + 2 or 4k + 3 (the integer is a multiple of 4 
with a remainder 0, 1, 2 or 3). Of these, 4k = 2(2k) and 4k + 2 = 2(2k + 1) 
are both even, whilst 4k + 1 = 2(2k) + 1 and 4k + 3 = 2Qk + 1) 4 1 are 
both odd, i.e. are of the form 2n + 1. 


(b) Any integer is of one of the three forms 3k, 3k + 1 or 3k + 2. Squaring, we 
obtain 


(3k)? = 3(3k?) 
(3k + 1)? = 3(3k? + 2k) + 1 
Gk + 2)? = 3(3k? +4k +1) - 1 
so that any square is of the form 3n or 3n + 1. 
(c) Using the same technique as in (b), 
(3k)? = 9(3k9) 
(3k + 1? = 9(3k? + 3k? +k) - 1 
(3k + 2)? = 9(3k? + 6k? + 4k) +8 
which are of the required forms. $ 


Any integer is of the form 5k, 5k + 1, 5k + 2. 5k +3 or 5k + 4. As in 


Example 3, we can simplify the solution by taking the alternative forms 5k — 2 
for 5k + 3 and 5k — 1 for 5k + 4. 


(5k)* =5-P,(k) 
(5k+1)*=5-P,(k)+1* =5-P,(k) +1 
(5k + 2)* = 5-P(k) +24 =5-(P2(k)+3)+1 


(5k — 2)* = 5-P3(k) + (—2)* = 5-(P4(k) +3) +1 
Gk — 1)* = 5- Pak) + (—1)4 =5-P,(k) +1 
so the fourth power of any integer is of the form 5k or of the form 5k +1. W 


Following the hint in B, by the Division Algorithm there are unique q and r 
with 0 € r' < ||| where a = qb + r. If0 < r < l|b| we let r = ' and q= d so 
a — qb + r and 0 < r < J|b|. If, on the other hand, 4b] <r’ < |b], let 

r =r — |b| so —3|b| <r < 0. This gives a = q'b + r + |b|. Now if b > 0 then 
|b| = b and we get a = (q + 1) b + r but if b < 0 then |b| = —b and we get 

a = (q — 1) b + r. This is why B suggests we take q = q' + 1 if b > 0 and 

q= q — Vif b < 0, so that whatever the sign of b we obtain a = qb +r. 


Now combining the situations we get a = qb + r where —1|b| < r < 4b]. It 
remains to prove that q and r are unique, so suppose that in addition 

a= qb + r, where —3|b| <r,  2|b|. Following the proof of the Division 
Algorithm, the equations a = qb + r and a= q,b + r, give 


la — qıl lb] = |r, — rl. («) 
Now adding the inequalities —5|b| < — r < 3|b| and —3|b| <r, < 1|b| gives 
—|b| < r; — r < |b| or equivalently |r, — r| < |B]. Substituting this in (*) we get 
la — 4,||b| < |b| which leads to |q — q,| < 1 and as |q — q,| is a non-negative 


integer this implies that q = q, which in turn leads to r = r,, completing the 
proof of the uniqueness of q and r. Bi 
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NOTE 


SAQ 11 


SOLUTION 
TO SAQ 11 


M335 1.5 
1.5 The Greatest Common Divisor 


The greatest common divisor (or highest common factor) of two integers usually 
assumes a transient importance in school work; it then vanishes rarely to be 
met again. In the theory of numbers it is of vital significance. The results of this 
subsection are of crucial importance in this course, so make sure now that you 
understand them, and how to use them. 


The first reading passage is concerned with basic divisibility properties of 
integers. 


READ B from the beginning of Section 2.2 on page 23 as far as page 24, line — 8. 


The proofs of parts (1) to (5) of Theorem 2-2 are straightforward. By all means 
take up B's invitation to try them as an exercise, but here are the missing 
proofs: 


(1) For all a, 0 = a-0 so aļ0 (by Definition 2-1) 
and a=1-aso lla 
and a — a-1 so aja. 


(2) all if there is an integer c such that ac = 1. (From our knowledge of the 
integers it is apparent that a — c = +1. But for completeness we shall prove 
that fact using properties of inequalities.) Now if a is positive then so is c 
since their product is 1 (which is positive); therefore as 1 is the least positive 
integer we know that c > 1. We can now multiply this inequality by the 
positive integer a and obtain ac = a, ie. 1 = a, which means a = 1 since we 
assumed that a > 0. 


If a is negative a similar argument shows that a — c — —1. 


(3) alb implies b = ka, for some integer k. 
c|d implies d = Ic, for some integer l. 
S 


o bd = ka- lc = (kl)ac, whence ac\bd. 


(4) a|b implies b = ka, for some integer k. 
b|c implies c = Ib, for some integer l. 
So c = (Ik)a, whence alc. 


(5) a|b implies b = ka, for some integer k. 

bla implies a = Ib, for some integer l. 

So ab = (kl) ab, giving kl = 1. This occurs if and only if k = l = +1, as in 
(2, whence a = +b. A 


B page 29, Problems 2.2, number 2. 
(a) Since a|b there exists an integer k such that b = ak. But then bc = a(kc) 
shows that a|bc, by Definition 2-1. 


(b) Since aļb and alc there exist integers k and | such that b = ak and c= al. But 
then bc = a?(kl), showing that a?|bc. 


(c) If a|b then b = ka, for some integer k, and bc = k(ac) shows that ac|bc. On 
the other hand, if ac|bc then bc = I(ac) for some integer l. Since c z 0 it can 
be cancelled from the two sides of the equation giving b = la; that is, ajb. Bl 


We now encounter the greatest common divisor of two integers. 


[READ B from page 24, line —7 to page 27, line 3. | 
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EXAMPLE 4 


SAQ 12 


SOLUTION 
TO SAQ 12 


NOTE 


M335 1.5 


So now we have seen that for given integers a and b, not both zero, the set of 
all linear combinations of a and b is precisely the set of all multiples of gcd(a, b) 
and, in particular, the least positive member of this set is gcd(a, b) itself. 

To illustrate our results so far let us look at 

B page 30, Problems 2.2, number 13. 


Part (a) is straightforward. If there exist integers x and y such that c = ax + by 
then, by the Corollary, c is a multiple of gcd(a, b). That is, gcd(a, b)|c. On the 
other hand, if gcd (a, b)|c, then c = k- gcd (a, b) for some integer k, and since 
there are integers x, y such that ax 4- by = gcd (a, b) we have c = k: ged (a, b) 

= kx:a + ky: b. 


Part (b) is also a consequence of the Corollary. Let ax + by = gcd(a, b), where 
: n b. 7 
gcd(a, b) = d. Then, since dja, A is an integer, and similarly, q is an integer. 
d 


Dividing through the given equation by d gives 


f x4 PIRE 


d 
So 1, the least positive integer, is a linear combination of x and y and is 
therefore the greatest common divisor of x and y. 


Use the Corollary on B page 26 to solve B page 30, Problems 2.2, number 11. 


Since ax + by = (—a) (— x) + by, it follows that every linear combination of a 
and b is a linear combination of — a and b, and conversely. Therefore the 
smallest positive linear combination of a and b is the smallest positive linear 
combination of —a and b. By the Corollary, gcd(a, b) = gcd(— a, b). 


The other two parts follow similarly from observing that 


ax + by = ax + (—b)(—y) = (—a)(—x) + (—b)(-y). M 


The final reading passage of the section focusses on pairs of integers whose 
greatest common divisor is 1, and leads to further important divisibility 
properties. 


READ B from page 27, line 5 to the end of Section 2.2. 


Page 29, line 4 

There are many ‘algebraic systems which have a natural concept of division. 
For instance, in the set of all polynomials with integer coefficients we say 
(3x? — 2x + 4)|(3x3 + x? + 2x 4 4) since 


3x? + x? + 2x + 4 = (x + 1) (3x? — 2x 4 4). 


Or, in the Gaussian integers (that is, the set {a + ib: a,b are integers, i? = — 1}) 
we have (1 + 2i)|(6 + 2i) since 6 + 2i = (1 + 2i) (2 — 2i). 


But in neither of these systems can we generalize B's Definition 2-2, since in 
neither system do we have a notion of 'greatest. Which is the bigger of 2x + 1 
and 3x — 5? Which is the bigger of 3 + 2i and 2 — 3i? However, properties (1) 
and (2) of Theorem 2-6 make sense in each of these systems, and so can be 
taken as a definition of greatest common divisor. (A minor problem ensues, 
because the greatest common divisor so defined is not unique—but that's 
another story!) A 


As an illustration of the use of Theorem 2-4 let us look at 
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AMPLE 5 B page 30, Problems 2.2, number 16, parts (a) and (c). 


SAQ 13 


SOLUTION 
TO SAQ 13 


1.6 


XAMPLE 6 


(a) To show that gcd(a, bc) = 1, Theorem 2-4 tells us that it suffices to find 
integers r and s such that ar + (bc)s = 1. Now we are given that 
gcd(a, b) = ged(a, c) = 1 and so, by Theorem 2-4 again, we know that there 
are integers x, y, u and v such that ax + by = 1 and au + cv = 1. Therefore, 
multiplying these equations together, 


1 = (ax + by) (au + cv) 
= a(aux + cox + byu) + be (yv) 


so that 1 is a linear combination of a and bc, as required. 


(c 


A 


To establish that gcd(ac, b) = gcd(c, b) we shall show that 
gcd (ac, b) < gcd (c, b), and that ged (c, b) < ged (ac, b). 


Given that gcd(a, b) = 1 we have ax + by = 1 for some integers x and v. 
Multiplying by c, 


(ac)x + b(yc) = c, 


shows that c is a linear combination of ac and b and hence, by the 
Corollary to Theorem 2-3, a multiple of gcd(ac, b). That is, gcd(ac, b)|c. But 
gcd(ac, b)|b, by the definition of greatest common divisor, and so being a 
common divisor of b and c, gcd(ac, b) X gcd(b, c) 


On the other hand, gcd(b, c) divides c (and therefore ac) and b, and so is a 
common divisor of ac and b. That is, gcd(b. c) < ged(ac, b). 
Use the result of Theorem 2-4 to prove the following. 
(i) If ged(a,b) = 1 and cla, then gcd(b, c) = 1. 
(ii) If ged(a, b) = 1 then ged(a,a + b) = 1. 
(i) Since ged (a, b) = 1 we can write ax + by = 1, for some integers x and y. But 
c|a, and so a — kc, for some integer k. Combining these two equations gives 
c(kx) + by = 1, 
so that ged (b, c) = 1, by Theorem 2-4. 


(ii) Again gcd(a, b) = 1 and so ax + by = 1. This equation can be rearranged 
as a(x — y) + (a + b)y = 1, which expresses 1 as a linear combination of a 
and a + b, as required. Bl 


The Euclidean Algorithm 


In the previous section we discovered a number of properties of the greatest 
common divisor, but so far we have developed no techniques for actually 
calculating the greatest common divisor of two given integers. If all divisors of 
the two integers are easily calculated then inspection will suffice, but for 
handling larger integers we need an alternative approach. 


Suppose we wish to calculate gcd (1729, 703). Applying the Division Algorithm 
(dividing 1729 by 703), we find that 1729 = 2-703 + 323. From this equation we 
see that any common divisor of 703 and 1729 also divides 323. On the other 
hand, any common divisor of 323 and 703 also divides 1729. It therefore follows 
that the set of all common divisors of 1729 and 703 is the same as the set of all 
common divisors of 703 and 323; in particular, gcd(1729, 703) = gcd(703, 323). 


E 


SAQ 14 


SOLUTION 
TO SAQ 14 


SAQ 15 


SOLUTION 
TO SAQ 15 


This has considerably reduced the size of the integers in our original problem, 
So it seems a good idea to repeat the procedure to reduce them still further. 
Let’s see what happens. 


1729 = 2-703 +323 ged(1729, 703) = ged(703, 323) 
703 = 2-323+57 ged(703,323) = ged(323, 57) 
323 =5-57 +38 ged(323,57) = gcd(57,38) 
57 =138 +19 ged(57,38) = ged(38, 19) 
38 2249 +0 gcd(38,19)  — 19 


To get from one line to the next in the above, we have applied the Division 
Algorithm to the previous divisor divided by the previous remainder. The last 
non-zero remainder is the greatest common divisor that we were looking for. 


Apply the above procedure to find gcd(306, 657). 


657 = 2-306 + 45 


306 = 6-45 + 36 
45 =136+49 
36 =49+0 


gcd(657,306)=9 gi 


This method of obtaining greatest common divisors, by repeated application of 
the Division Algorithm, is known as the Euclidean Algorithm. The next reading 
section proves that the Euclidean Algorithm does produce greatest common 
divisors and sets up the machinery for finding integer solutions x and y to 
equations of the form ax + by = c. 


READ B from the beginning of Section 2.3 on page 31 to page 35, line 6. 


B page 37, Problems 2.3, number 2, parts (a) and (d). 


(a) 72 2 1-56 + 16 (d) 2378 = 1-1769 + 609 
56—3-16 +8 1769 = 2-609 + 551 
16=2-8+0 609 — 1:551 +58 
<. gcd(56,72) = 8 551 =9-58 + 29 
Reversing these steps, 58 =2-:29+0 
8 = 56—3-16 -. gcd (1769, 2378) = 29 
8 = 56 — 3(72 — 1-56) 29 = 551 — 9-58 
8 = 4-56 + (—3)-72 29 = 551 — 9(609 — 1-551) 


29 = (—9)-609 + 10(1769 — 2-609) 
29 = 10-1769 — 29(2378 — 1-1769) 
29 = 39-1769 + (—29)-2378 W 


The remainder of B Section 2.3 introduces the closely allied concept of the least 
common multiple of two integers. 


READ B from page 35, line 7 to the end of Section 2.3. 


SAQ 16 


SOLUTION 
TO SAQ 16 


1.7 


SAQ 17 


SOLUTION 
TO SAQ 17 
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Find lcm(306, 657). 


Since gcd(306, 657) = 9 by SAQ 14, Icm(306, 657) = RES = 22338. W 


The Linear Diophantine Equation 


The term Diophantine equation is used to signify an algebraic equation in one 
or (usually) more variables which is to be solved in the set of integers. As this 
course progresses, we shall encounter numerous Diophantine equations, and 
Unit 8 is devoted to looking at some of the more famous problems in this area. 
In this section we introduce the simplest non-trivial Diophantine equation: the 
linear equation in two variables. Its presence is appropriate at this point 
because of the role played in its solution by the greatest common divisor of the 
coefficients of the two variables. However, it should be mentioned that the 
solution of the linear Diophantine equation that we obtain in Theorem 2-9 is of 
more theoretical importance than of practical use. We shall later find other 
methods of solving these equations. 


READ B from the beginning of Section 2.4 on page 38, to the foot of page 39. 


Which of the following Diophantine equations have solutions? (We are not 
asking you to solve them at this stage.) 


(i) | 54x + 21y = 906 

(ii) 45x + 105y = 200 

(ii) 201x — 101y = 1 

We test whether the greatest common divisor of the coefficients of x and y 
divides the right-hand side. 

(i) ged(54, 21) = 3- gcd(18, 7) = 3, and 3/906 so the equation has solutions. 


(i) gcd(45,105) = 3 - gcd(15,35) = 15- gcd(3, 7) = 15, and 15/200 so the 
equation has no solutions. 


(iii) From the Euclidean Algorithm: 
201 = 1-101 + 100 
101 = 1400 + 1 
100 = 100-1 


We see that gcd(— 101,201) = gcd(101,201) =1, and the equation has 
solutions. BN 


We have seen that the equation ax + by = c has solutions if and only if djc, 
where d = gcd(a, b). Suppose we have a solvable equation in which d = 2. In 
this case, the coefficients a, b and c have a common divisor d which could be 
cancelled from the two sides of the equation. For instance, on page 39 B 
considers the equation 3x + 6y = 18, where gcd (3,6) = 3, which divides 18. 
Dividing by 3, we could consider instead the equation x + 2y = 6, which has the 
same solution set. 


An alternative way of looking at the solvability of the equation ax + by = c is to 
cancel out all common factors of a, b and c to obtain an equation a'x + b'y = c 
which has the same solution set. This equation has a solution if and only if 
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SAQ 18 
SAQ 19 
SAQ 20 


SOLUTION 
TO SAQ 18 


SOLUTION 
TO SAQ 19 


gcd(d', b^) divides c^, but, as all common factors have been cancelled, no non- 
trivial divisor of a’ and b’ can divide c'. So for solutions we must have 
gcd(a’,b’) = 1. 


In the next reading passage we extend the result of the solvability of 

ax + by = c to show how, from any one solution, all other solutions can be 
obtained. In Theorem 2-9, B continues to look at the case where gcd (a,b,c) = d 
may exceed 1 (which will be needed in Unit 3): he summarizes the important 
case d — 1 in the Corollary on page 42. 


READ B from the top of page 40 to the end of Section 2.4. 


B page 43, Problems 2.4, number 2, parts (b) and (c). 


Find an integer solution of 201x — 101y = 1 which satisfies 200 < x < 300. 


If I spend exactly £10 to buy some 8p and some 10p stamps so that the 
difference in number between the two kinds of stamps bought is as small as 
possible, how many stamps have I bought? 


(b) 54x + 21y = 906 
We have seen in SAQ 17 that gcd(54, 21) = 3, and as 3|906, the equation has 
solutions. We first (optionally) divide through by 3, obtaining the equation 
18x + 7y = 302. Reversing the steps in the Euclidean Algorithm we next 
express gcd (18,7) = 1 as a linear combination of 18 and 7. 


1=4-3 


18=2-7+4 

=4-(7-4)=2:4-7 
7—1:4435 — 

—-2-:(18—2-7)—-7 
4=1:3+1 ( ) 


=2.-18 + (—5).7 
Now multiplying by 302 gives 
302 = 604-18 + (—1510)-7 
so that x, = 604, y = — 1510 is one solution. The other solutions are given by 
{x = 604 + 7t, y = —1510 — 18t: t any integer}. 
We require that x and y are both positive. 
For x > 0, 604 + 7t > 0. That is, t > —86. 
For y > 0, — 1510 — 18t > 0. That is, t < — 84. 
There are therefore three solutions in positive integers: 
(i) t= —86, giving x = 2, y = 38; 
(ii) t= —85, giving x = 9, y = 20; 
(iii) t = —84, giving x = 16, y = 2. 


(c) 123x + 360y = 99 
This equation clearly has no positive solutions. For if x > 1 and y 21 then 
the left-hand side has least value 483. Bg 

201x — 101y 21 


We have already seen that this equation has solutions and, since 
2-101 — 201 = 1, it is not unreasonable to ‘spot’ a first solution 
201 (—1) — 101 (—2) = 1. By Theorem 2-9 the general solution is given by 


{x 1—101t,y = —2 — 201t: t any integer}. 
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TO SAQ 20 
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ROBLEM 1 


ROBLEM 2 


ROBLEM 3 
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In order to satisfy the condition 200 < x < 300 we require 


200 < —1 — 101: < 300 


201 « —101t « 301 
which is clearly satisfied only by t — —2. So the answer is 
x= —14- 101-2 — 201 
y= —2 + 201-2 = 400. m 


Let x be the number of 8p stamps and y the number of 10p stamps. Then 
8x + 10y = 1000. Since gcd(8, 10) = 2 and 2|1000, we divide by 2 obtaining 
the equivalent equation 4x + 5y = 500, known to have solutions. 


Indeed, some solutions are easily spotted, for instance x = 0 and y = 100. Using 
this particular solution, Theorem 2-9 gives us the general solution: 


{x =0 + 5t, y = 100 — 4t:t any integer}. 
Now we wish to minimize |x — y|, ie. [9t — 100|. This is done by choosing t to be 


à 100 s "M 
the nearest integer to ^ namely t — 11. The corresponding solution is 


x= 55, y = 56, 
and the total number of stamps bought is 111. B, 


Problem Solving 


We round off this unit with a short section on problem solving. The problems 
listed below are, by and large, more difficult than corresponding SAQs in the 
text and are different in style, being investigatory. Three of the problems ask 
you to discover the result in question before attempting to prove-it. The section 
is accompanied by a tape cassette in which we discuss the problems and give 
you guidance on how to go about solving them. 


The sole objective of this, and a similar sections in Unit 3, is to help you 
develop skill in problem solving. 


If you feel confident that you can solve the problems below without the help of 
the tape then fire away. You will find formal solutions to these problems at the 
end of the Exercise Booklet. Otherwise: 


START THE TAPE WHEN YOU ARE READY 


Investigate values of n! and n° for small values of n. Make a conjecture about 
the relative sizes of n! and n° and prove your result. 


(a) Show that 2 divides n° — n for all integers n. 


(b) Investigate n? — n for small values of n and make a conjecture about the 
largest integer which divides n° — n for all integers n. 


(c) Prove your conjecture. 

(a) Find all integer solutions of the simultaneous system of equations below. 
3x + Ty + 9z = 300 
2x + 3y + 5z = 167 


(b) For which of these solutions are x, y and z all positive? 
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PROBLEM 4 
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(a) Why does the Diophantine equation 
6x + 15y — 18z = 100 
have no solutions? 
(b) What condition will ensure that the Diophantine equation 
ax + by+cz=k 
has no solutions? Prove your result. 


(c) (Much harder.) What conditions will ensure that the equation in (b) has 
integer solutions? Prove your result. 


Summary 


In this unit we have covered Chapters 1 and 2 of B. Chapter 1, the historical 
section apart, revised two familiar ideas; proof by mathematical induction in its 
various forms, and the Binomial Theorem, whose coefficients we shall encounter 
from time to time in the course. 


The first three sections of Chapter 2 form the real foundation of the whole of 
this course. Many of the ideas that we met there—the Division and Euclidean 
Algorithms, greatest common divisor and least common multiple—were quite 
possibly not new to you, but the simple divisibility properties of the integers 
that they entail are central to the whole of the course, and these ideas must be 
grasped now. 


The final section of Chapter 2 investigated the linear Diophantine equation and 
showed how, when the equation is solvable, to reach a solution by means of the 
Euclidean Algorithm. 


The main definitions and results encountered in this unit are listed in the 
corresponding entry in the Handbook. 
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UNIT = 
PRIMES AND 
THEIR DISTRIBUTION 


2.0 


2.1 


NOTES 


Introduction 


This unit comprises Chapter 3 and the first section of Chapter 13 of B. The unit 
is mainly concerned with properties of the prime numbers: 


2,315, 7; 11, 13; 17; 19523599 6 3 on 5 


that is, those positive integers which have exactly two positive divisors; the 
integer itself and 1. The major result of the unit, the Fundamental Theorem of 
Arithmetic, tells us that every integer greater than 1 can be written as a product 
of primes and that this product is unique apart from the order in which the 
primes are written. For example, 12 may be written as 3-2-2 or 2-3-2, or 
2-2-3, but no other factorizations of 12 into a product of primes are possible. 


The fact that the primes are the ‘building blocks’ from which all natural 
numbers are made certainly accounts for their importance. It does not, however, 
account for the fascination which they have exerted—and continue to exert. 
Many of the most famous problems in mathematics, both solved and unsolved, 
involve prime numbers and their distribution. How many primes are there? Is 
there a largest prime? Is there a formula for generating all prime numbers? 
How many primes p are there so that p + 2 is also prime? Can every even integer 
greater than 2 be written as the sum of two primes? This unit will provide 
answers to some of these questions and will discuss others which, despite 
considerable numerical evidence to support their validity, remain unsolved 
despite years of continual assault upon them. 


In Section 2.4 we introduce and discuss what is perhaps the most intriguing 
sequence of integers in mathematics—the Fibonacci sequence. As well as being 
of interest for its own sake, the Fibonacci sequence will enable us to gain 
practice in using results and techniques developed elswhere in the first two 
units; in particular, divisibility properties which, by this stage, include the 
divisibility properties for primes. 


The Fundamental Theorem of Arithmetic 


Tn this section we introduce the primes and give a proof of the fact that every 
integer greater than 1 can be expressed as a product of primes in a unique way 
(apart from their order). 


READ B Section 3.1, page 46, as far as page 47, line —4. 


(i) Page 46, line 14 
Many students are reluctant to abandon the mistaken belief that 1 is a 
prime. But, as B says, we do not classify 1 as either prime or composite. 
For, if we classify it as prime then we can express integers as products of 
primes in many different ways; for instance, 


6=2-3=1-2-3=1-2-3-1 


and the Fundamental Theorem of Arithmetic is lost. On the other hand, 
we wish to reserve the description composite for those positive integers 
which can be written as a product of two or more primes. 


(ii) Page 47, line 6 
If the concept of a prime is not new to you, you may feel that Theorem 3-1 
and its Corollaries are so obvious that they hardly need proving. However 
fundamental primes may be, the theorems about them do not spring full- 
grown out of Definition 3-1 alone. Here for instance, following the logical 
development of B, we call upon Euclid’s Lemma which was proved 
previously without reference to primes. A 
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To illustrate the use of these results let us look at 
Let p be a prime and a and b be integers. Prove 
(a) if pla” then pla"; 
(b) if gcd(a, b) = p then gcd(a", b") = p". 


(a) Using Corollary 1 with a, = a, = ++- = a,, we deduce immediately that if 
pia" then pla. So writing a = pr, for some integer r, and raising both sides of 
this equation to power n gives a" = p"r", which tells us that p"|a". 


(b) As gcd(a, b) = p, there exist integers r and s such that a = pr and b = ps, 
with gcd(r, s) = 1. But then 


ged(a", b^) = ged(p"r", p's") 
= p' gcd(r", s") (by Theorem 2-7, B page 34) 
and it remains to show that gcd(r", s") = 1. 


Suppose q|gcd(r", s"), where q is a prime. Then q|r" and q|s". As we saw in 
part (a), this implies that q|r and q|s, so that q|gcd(r, s). But ged(r, s) = 1, 
and so no such prime q can exist, and gcd(r", s") = 1. 


B page 51, Problems 3.1, number 7. 


Given that p, and p, are distinct primes such that their product p, p, divides a", 
prove that (p,p;)"a". 


By Corollary 1, any prime dividing 50! must divide one of the numbers in the 
sequence 1, 2, 3,...... , 49, 50, and hence cannot exceed 50. On the other hand, 
each prime not exceeding 50 occurs in this sequence and so certainly divides 50! 
The question therefore boils down to finding all the primes not exceeding 50. 
These are 


2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47. 


(A list of primes less than 10,000 is given in Tables 2 and 3 in the Appendices of 
B, and a systematic method of finding primes is described in Section 2.2 of this 
unit) Bi 


Since p,p;|a" we know that p,|a" and p,|a". As in part (a) of Example 1, this 
gives p,|a and p,|a. Now gcd(p,,p,) = 1. Therefore, appealing to Theorem 2.4, 
Corollary 2, p,p;|a. Finally, writing a = p,p;b gives a" = (p,p,)"b", and so 
(p,p2)"|a", as required. Bi 


READ B from page 47, line —3 to page 50, line 5. 


Page 49, line —8 

The literal meaning of the word canonical is standard (or regular). In 
mathematics, when some quantity or formula is capable of being expressed in a 
variety of ways it is often convenient to choose a particular presentation as the 
standard one, i.e. the canonical one. The canonical form for expressing an 
integer greater than 1 as a product of primes is the one in which the prime 
factors are placed in ascending order and multiple occurrences of the same 
prime are collected together and written as a power. 


To illustrate the canonical form let us look at 


B page 52, Problems 3.1, number 15. 


Firstly let us suppose that a > 1 is a square, say a = b?, where since a > 1 we 
may choose b > 1. Let the canonical form for b be 


ki 


b = pi pe --- pk. 
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Then 


a=b = (pfr pir)? = pi^ pj -p 


which is in canonical form (the primes are in ascending order with multiple 
occurrences collected together), with all the exponents even. 


On the other hand, if all the exponents in a are even then 
a = py" pj? -p 
= ppp) 

is certainly a square. 


B page 51, Problems 3.1, number 12. 


Prove that an integer a > 1 is an nth power if and only if in the canonical form 
of a all the exponents are divisible by n. 


B page 50, Problems 3.1, number 3. 


1234 = 2-617 

10140 = 22.3.5.13? 

36000 = 25.32.53, E 

Let a = b" where b = pl! pl? ..- p* is in canonical form. 


Then 


ko, 


P? 


nk; 


a — b" — (p epey- pik pe -p 


a 

has each exponent divisible by n. 

On the other hand, if a has canonical form with each exponent divisible by n, then 
a= pi pa? pk (ppp 

is certainly an nth power. W 


(a) Let p = 3n + 1. As 2 is not of the form 3n + 1, p is an odd prime. Hence 
p — 1 = 3n is even, and so 2|3n, from which it follows that 2|n. Writing 
n = 2m gives p = 6m + 1 as required. 


(b) Let 3n + 2 = pi p? ... p!" be its canonical form, so that the primes dividing 
3n + 2 are Pi, p,,..., p, where r > 1. Now 3n + 2 is not divisible by 3 and so 
none of p; is 3. Therefore each prime divisor of 3n + 2 is of one of the forms 
3k +1 or 3k 4 2. 


The identity 
(Sr + 1)(3s + 1) = 3(3rs +r s) - 1 


shows that the product of two numbers of the form 3k + 1 is of the same 
form. It follows, by induction, that any product of numbers of the form 

3k + 1 is itself of that same form. Hence, if each pi is of the form 3k + 1 then 
so too is 3n + 2, which is not the case. We conclude that at least one of the 
prime factors p; of 3n + 2 is of the form 3k + 2. 


SA 


(c) Using the hint, 


m—12(n—1)(P +n+1) 
expresses the prime n? — 1 as a product of two positive factors. The 
uniqueness of factorization tells us that this is possible only if one of the 
factors on the right-hand side is equal to 1. Now n > 1 (since n? — 1 is a 


prime), and so n? +n + 1 > 3. The only remaining possibility is that 
n—1= 1, ie. n= 2, giving  —1- 7. 
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(d) Suppose 3p + 1 is a square, say 3p + 1 = r?. Then 
3p =n? —1=(n—1)(n4+1). 


As 3p + 1 = 7 we know that n > 2 so that neither factor on the right of the 
above equation is equal to 1. By uniqueness of factorization we therefore 
have either 3 = n — 1 and p=n+1, or 3 =n+1 and p =n — 1. These give 
respectively n — 4, p — 5 and n — 2, p — 1, and the first case is the only 
possibility BH 


If the prime factorization of an integer is given, it is a simple matter to obtain 
all divisors of that integer. For example, spend a minute or two thinking about 
the prime factorizations of divisors of the integer 252 = 2? - 3? -7 and, without 
calculating them, see if you can formulate a conjecture about the set of all 
divisors of 252. 


You should quickly appreciate that if d is a divisor of 252 then d can involve 
only the primes 2, 3 and 7. What is more, since 23252, 33252 and 77252, the 
exponent in d of each of these primes cannot exceed the corresponding exponent 
in 252. From these facts we draw the conclusion that the divisors of 252 are the 
integers 2” - 35. 7 where r = 0, 1 or 2, s = 0, 1 or 2 and t= O or 1. These 18 
integers (3 choices for r, 3 choices for s and 2 choices for t give a total of 3-3-2 
choices, and by uniqueness of factorization the resulting numbers are distinct) 
are the only divisors of 252. Notice that they include 1 (r = s = t = 0) and 252 
itself (r = 5 = 2, t— 1). 


Let us turn to the general situation. Let 
a= peek 


be an integer greater than 1 given in canonical form. Extending the observations 
of the above example we claim that the set of all divisors of a is the set of 
integers 


(ptp?---p':0s l; < k; fori ^ 1,2,...,r). 
Certainly each integer in this set is a divisor of a, since 
a= pit pee pe = (pi p3 ppp peo? pio 


and so our task now is to show that these, which are distinct by unique 
factorization, are the only divisors of a. 


Let d be any divisor of a and let p be a prime factor of d. Since p|d and dļa we 
know that p|a and so, by Theorem 3-1, Corollary 2, p — p; for some i in the 
range 1 < i <r. As the possible prime factors of d are p1, p5,..., p, it now 
follows that we can write d = p p? +: pr, where each l; is a non-negative 
integer. 


Finally, to show |, € k; we utilize the fact that dļa and in particular that p'|a. As 
the primes p,, p>,...,p, are distinct, p,J/p? p* - -- pt”, for otherwise we contradict 
Theorem 3-1, Corollary 2. It follows that ged (p'?, p? p * pF) = 1. Applying Euclid’s 
Lemma, p} |p% (p pf --- pk) gives p'i|pf*, which requires |, < kı. The same 
argument can be used to establish |, € k;,..., 1, X k, as required. 
(a) Let a= 33.5.17? and b = 2.3? -7 17*. 

(i) How many positive divisors does a have? 

(i) List the common divisors of a and b. 

(ii) Determine gcd(a, b) and lcm(a, b). 


(b) In general, if a and b are integers whose prime factorizations are given, 
how do you obtain gcd(a, b) and lem(a, b)? Prove your result. 
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M33> 2.1 
SOLUTION (a) (i) The divisors of a are the integers 3"-5°-17' where 0 <r < 3055S TT 
TO SAQ 6 and 0 < t € 2, of which there are 4-2-3 = 24 
(ii) The common divisors of a and b are 
1, 3, 32, 17, 3-17, 33-17, 17, 3-17? and 32.177, 
(ii) ged(a, b) = 37-17? 


ab 
gcd (a, b) 


8245517252..3*. 7.174 
82.177 


203? 509 17%, 


lem(a, b) = 


(b) Let p,,p2,...,p, be the primes which divide either a or b. Then we write 
a= pi p? -př and b= pit pe... pr 
where exponents may be zero. For example, k, = 0 if pilb but pJa. 
A divisor of a is an integer 
d-—pypyep 
where 0 < o; < k; foreach i = 1,2,...,r. For dalso to divide b we require a; < l, 
foreach i = 1,2,...,r. Combining these two conditions, d is a common divisor of 
a and b if and only if 0 < o; < min {k,, l;} (the smaller of k; and 1;) for each 
i= 1,2,...,r. Therefore, 


gcd(a, b) = prin s, 1) pymin{ka, Ll m . prints rl 


In words, we obtain the prime factorization of gcd (a. b) by raising each 
prime to the power which is the smaller of its exponents in a and b. 


ab 
lem(a, b) = ———— 
gcd(a, b) 
kı pk ERES 5 
Pip? pe ppp 
F pinta D) . . printet} 
= pitti- minka) — Bemis: 


For any two numbers k and l, k + 1 — min {k,l} = max {k, l} (the larger of k 
and l). Therefore 


lcm (a, b) = prraxtkast} pmaxtk2,t2} $a praxis) 


In words, we obtain the prime factorization of lcm (a,b) by raising each 
prime to the power which is the larger of its exponents in a and b. 


As a further illustration of these results, if a = 33.53. 11.19 and 
b — 2-3? 7-11? then 


gcd(a, b) = 29.32.59. 79.111.199 = 32.11 
(where for each prime occurring in a or b we take the smaller exponent) and 
lem(a, b) = 2-33 .53 .7.11?.19 


(where the larger exponent is taken) I 


READ B from page 50, line 6 to the end of Section 3.1. 


There are many different forms that the proof of Theorem 3-3 can take. The 
usual elementary proof observes that if a? = 2b? then a? is even which means 
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that a must be even (as the square of an odd integer is odd). Replacing a by 2c 
gives 2c? — b?, and the same argument now tells us that b is even. So2 isa 
common factor of a and b, contradicting the assumption that gcd(a, b) — 1. 


The advantage of the proof given in B is its generality. For if N is a positive 
integer which is not a perfect square, the same proof (replacing 2 by N as 
appropriate) shows that JN is irrational. 


The uniqueness of prime factorization provides us with a third proof. Suppose 
a? = 2b?. We know from SAQ 4 that each prime exponent in a? and in b? 

is even. Focussing upon.the prime 2, its exponent in a? is even whilst its 
exponent in 2b? is odd, a contradiction. Hence a? = 2b”, (ie. a/b = J is 
impossible. 


Prove that if the integer c > 1 is not the nth power of an integer then vc, the 
nth root of c, is not a rational number. 


Suppose that / c is the rational number a/b. Then a" — cb". By SAQ 4 we 
know that each prime divisor of a" and b” has exponent divisible by n and that 
c (since it is not an nth power) has at least one prime divisor, say p, whose 
exponent is not divisible by n. So the exponent of p in a" is divisible by n whilst 
the exponent of p in cb" is not divisible by n. This contradicts the uniqueness of 
prime factorization. W 


The Sieve of Eratosthenes 


As we have remarked, Tables 2 and 3 at the end of B give a complete list of 
primes less than 10,000. How would one go about compiling such a table? In 
the solution to SAQ 1 you used all the primes less than 50 and maybe you 
found them by applying the definition, taking each integer in turn and trying to 
find factors by dividing by all smaller positive integers. If so, you will realize 
that such a crude method would be very time consuming. 


Fortunately, there is a rather more efficient way of testing for primality and for 
producing lists of primes, and both techniques are discussed in this section. 
Finally, we settle one of the problems posed in the introduction to the unit by 
proving that there are infinitely many primes. 


READ B Section 3.2 from page 52 as far as page 54, line — 12. 


B page 57, Problems 3.2, number 1. 
B page 57, Problems 3.2, number 2. 
B page 57, Problems 3.2, number 3. 
As 26? < 701 < 27’, we test 701 for divisibility by the primes not exceeding 26. 


These are 2, 3, 5, 7, 11, 13, 17, 19 and 23, and as none of them divide 701 the 
latter is a prime. 


Similarly, 31? < 1009 < 32?, and as 1009 is not divisible by any prime X31 it is 
itself prime. WM 


Since 14 < ./200 < 15, removing from the set of integers from 100 to 200 all the 
multiples of 2, 3, 5, 7, 11 and 13 leaves just the primes behind. These are: 


101, 103, 107, 109, 113, 127, 131, 137, 139, 149, 151, 
157, 163, 167, 173, 179, 181, 191, 193, 197, 199: M 
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We have seen that a composite integer n must have a divisor other than 1 which 


does not exceed Jn. This problem is a simple extension of this result to integers 
having three (or more) non-trivial divisors. 


Suppose n = abc where a > 1, b > 1 and c > 1. Now if a > n, b > Jn and 
c > \Yn their product abc exceeds n, which is a contradiction. Hence at least one 


of these divisors must be less than or equal to In, and the divisor in question 
has a prime divisor with the same property. 


So, if pyn for all primes p € Jn, then n cannot be expressed as the product of 
three factors each exceeding 1, which means that n is the product of at most two 
(not necessarily distinct) primes. BN 


READ B from page 54, line — 11 to the end of Section 3.2. 


Page 56, line 17 

In Theorem 3-5 and its proof, use is made of indices which are themselves 
powers, for instance 2?" '. Strictly speaking, such expressions ought to 
include brackets, since a” is not in general equal to (a^). For example, 


209.2281 par Q3y = 8* = 212. 


(i 


= 


By convention, both in B and elsewhere, an unbracketed expression a’ is 
always to be interpreted as a^^. 


(ii) Page 56, line — 11 
Observe that the proof uses the Second Principle of Mathematical 
Induction, as the induction hypothesis assumes that p, < 2?‘"' for each 
integer k from k= 1 to k — n. 


A considerably better estimate of the size of p, than that obtained in 
Theorem 3-5 is p, < 2". One method of proving this result is by using 
Bertrand's Conjecture, which states that for every positive integer n > 1 
there is a prime p satisfying the inequality n < p < 2n. This conjecture was 
proved by Chebyshef in 1850, and although the proof is not within the 
scope of this course it is instructive to see how the estimate p, X 2" is 
derived from it. 


Assuming Bertrand's Conjecture, use induction to show that the nth prime Pn 
satisfies p, < 2”. 


B page 57, Problems 3.2, number 7. 


Since p, = 2 < 2!, the result p, < 2" is true when n = 1. Assume the result is 
true when n = k > 1, ie. p, X 2*. By Bertrand's Conjecture there is a prime p 
satisfying 2 < p < 2.2* = 2**'. By hypothesis, there are at least k primes not 
exceeding 2* so with the existence of p there must be at least k + 1 primes not 
exceeding 2**!. That is, p,,, < 2**!, completing the induction step. By the first 
Principle of Finite Induction P, <2" foralln=>1. E 


Suppose there are only finitely many primes and let p be the largest one. 
Consider N = p! + 1 which, being greater than p, cannot be prime and so has a 
prime factor, q say. As p is the largest prime, q < p and so q|p! But g|N and 
therefore g|N — p! That is, q|1, an obvious contradiction. W 
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The Goldbach Conjecture 


Having seen that there are infinitely many prime numbers, this section looks at 
the distribution of primes within the positive integers. There appears to be no 
regularity about the occurrence of primes, and having arrived at one, there is no 
way of predicting how far we must go to reach the next. Nevertheless, we shall 
see that the gaps between successive primes become arbitrarily large; there is a 
gap of 6 between 23 and 29, a gap of 8 between 89 and 97, and so on. On the 
other hand, the minimal gap of 2 between odd primes, for pairs like 11, 13 and 
41, 43, occurs very frequently. The conjecture that there exist infinitely many 
such pairs of primes is one of the outstanding unsolved problems of number 
theory. 


So what questions can we reasonably ask about occurrences of primes? If you 
look at the arithmetic progression 


5, 11, 17, 23, 29, 35, 41, 47, 53, 59, 65, This 


you will no doubt be struck by the preponderance of primes. However, no 
arithmetic progression 
a, à + b, a + 2b, a + 3b,..., a=1,b>0 


can consist entirely of primes since, as we said earlier, the gap between 
successive primes becomes arbitrarily large, and once we have a gap of length b, 
ie. b successive composite integers, the given arithmetic progression must 
contain one of these composites. 


In fact every arithmetic progression, like the one above, with gcd (a, b) — 1, 
contains an infinite number of primes. This classical result of number theory is 
due to Dirichlet and, as its proof is beyond the scope of this course, we shall 
restrict ourselves in this section to proving two special cases of it. Later in the 
course, when we have more machinery at our disposal, we shall find that we can 
prove several more particular instances of Dirichlet's Theorem. 


En route to looking at arithmetic progressions, the section interposes a 
discussion of Goldbach's Conjecture. Like so many other problems involving 
primes, this famous conjecture has overwhelming numerical evidence for its 
validity but still awaits a proof. 


READ B Section 3.3 from page 58 as far as page 60, line 11. 


Page 59, line —3 
It is usual to denote by z(n) the number of primes in the set (1.2..... n! so that, 


In the language used here we could therefore say that ‘almost all’ positive 
integers are composite. Landau might have observed that 0% of all positive 
integers are prime, despite there being infinitely many of them! A 


B page 64, Problems 3.3, number 2. 


(a) pp +2) c 1— (p +17. 


(b) Any integer is of one of the forms 6k + r, where r = 0, 1, 2, 3, 4 or 5. When 
r is even, 6k + r is even, and when r = 3, 6k + r is divisible by 3. Therefore 
the prime p > 3 must be of one of the forms 6k + 1 or 6k + 5. Now 
p = 6k + 1 gives p + 2 = 6k + 3 which is composite. We conclude that 
p — 6k + 5 and 


p (p 2)-26k4 54 (6k + 7) = 12(k + 1). 
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COMMENT The observation that any prime greater than 3 must be of one of 
the forms 6k + 1 or 6k + 5 certainly leads to an elegant solution. But it may 
well be that you failed to make this observation. A simpler approach would be 
to start by noting that p must be of one of the forms 3k, 3k + 1 or 3k + 2. For 
p > 3 and p + 2 to both be prime, p = 3k + 2 is the only possibility, and this in 
turn requires k to be odd. Then 


2 


Alternatively, the observation that any prime greater than 3 must be of one of 
the forms 12k + 1, 12k + 5, 12k + 7 or 12k + 11 leads to an immediate 
solution. BN 


READ B from page 60, line 12 to page 62, line 9. 


Page 61, line —3 

The proof given in B that every arithmetic progression of positive integers 
contains infinitely many composite numbers may seem unnecessarily 
complicated. For every integer n, the term a 4- (na)b = a(1 + nb) is a composite 
term in the progression whenever a z 1. This simpler argument breaks down, 
however, for those progressions with initial term a — 1. A 


P + (p- 2) = (3k + 2) + (3k + 4) = 6(k + 1) nf) 


B page 64, Problems 3.3, number 13. 


(Hint: Assume that q,. q,...., q, are the only primes of the form 6k + 5 and 
obtain a contradiction by showing that 64, d;...q, — 1 has a prime divisor of 
the form 6k + 5. You will need an analogous result to the Lemma on B page 60.) 


B page 65, Problems 3.3, number 18, part (a). 
(Interpret ‘every pth term’ as meaning ‘every pth term from some point onwards.) 
Following the lines of the proof of Theorem 3-6, we assume that there are only 


finitely many primes of the form 6n 4- 5, and aim to derive a contradiction; call 
them q,, q,,....q,. Consider the positive integer 


N —64,4,...4, — 1 = 6(q, q2... q, — 1) + 5. 


Since N is of the form 6n + 5 it is not divisible by 2 or 3, and so (as in the 
solution to SAQ 13(b)) each prime divisor of N is either of form 6n + 1 or 
6n + 5. Now the identity 


(6n + 1) (6m + 1) = 6(6nm + n +m) +1 


shows that the product of two (and, consequently, any number of) integers of 
the form 6n + 1 is also of that same form. But N is not of the form 6n + 1 and 
so not all its prime divisors can be of that form. We conclude that there is at 
least one prime divisor p of N which is of the form 6n AD: 


Finally, p 4 q; for i= 1,2,...,s, for otherwise D|64, d5...q, — N, ie. pil. 


This contradicts the assumption that there are only finitely many primes 
d1: 25... qd, Of the form 6k + 5, and establishes an infinitude of such primes. W 


Following the hint, as gcd(b, p) = 1 there exist integers r and s such that 
pr + bs = 1 (by Theorem 2-4). For each integer k, positive or negative, let 
n, = kp — as. Observe that the (double) sequence of integers 


e A an p Mgs Aai 


is in arithmetic progression with common difference p. Let n, be the first positive 
member of this sequence. Then 


a+ nb, a+ nb, a 4 nj, ,b,... 
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gives every pth term of the original arithmetic progression (from some point 
onwards), and it remains to show that each of these is a multiple of p. Now for 
all k, 


a+ nb =a + (kp — as)b = a + kpb — asb 


a + kpb — a(1 — pr) = kpb + apr 
= p(kb + ar), 
completing the proof. i 


READ B from page 62, line 10 to the end of Section 3.3. 


Find the smallest positive integer n for which the function f(n) = n? 4- n + 17 
takes a composite value. 


f(n) =n? +n4+17=n(n4 1) +17, 


and so f(n) is composite when 17|n(n + 1). The first time this happens is when 
n= 16, giving f(16) = 177. 


Substituting n = 1,2,...,15 in f(n) produces the integers 19, 23, 29, 37, 47, 59, 
73, 89, 107, 127, 149, 173, 199, 227 and 257, all of which are prime (see Table 2): 
So the smallest positive integer n for which f(n) is composite is 16. BN 


The Fibonacci Sequence 


In the previous section we investigated occurrences of primes in arithmetic 
progressions and in sequences 


F0), SO) f G).... 


where f is a polynomial with integer coefficients. The next step along this road 
is to investigate occurrences of primes in recursive sequences, i.e. sequences 
defined by recurrence relations. We shall restrict ourselves to considering the 
best known of all such recursive sequences, the Fibonacci sequence: 


1, 15.25 3,5, 8, 13, 2L... 
which is defined by 
uy =U, =1; u,—u,-,--u,-2,nm 3. 


We shall discover in this section that the members of the Fibonacci sequence, 
the Fibonacci numbers, have fascinating divisibility properties, although it is still 
an open question as to whether there are infinitely many prime Fibonacci 
numbers. The section will not only serve as useful revision for ideas and results 
of Unit 1, but will also illustrate the usefulness of ‘congruence’, which is the 
theme of Unit 3. 


READ B Section 13.1 from page 286 as far as page 288, line — 10. 


The result just cited is not special to primes. For any positive integer n there are 
infinitely many Fibonacci numbers which are multiples of n and they occur 
equally spaced in the Fibonacci sequence. Although B comments that the 
general proof of this result is quite cumbersome, there is an elegant proof 
However, this requires some understanding of congruence and we shall therefore 
defer it until Unit 3. 
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In fact, for each particular value of n the result can be demonstrated without too 
much difficulty. (Unfortunately, each proof is different and there is no way of 
deriving from them one proof which works for all positive integers n.) Let us see 
how we would go about showing that every fourth term of the Fibonacci 
sequence is divisible by 3. 


We write down a new sequence obtained from the Fibonacci sequence by 
replacing each term by its remainder on division by 3: 


1,1,2:0,2,2.1,0, 1, 1, 20) 2; 913,0... 


By the Division Algorithm, the terms can only be 0, 1 and 2, where the 0’s 
correspond to the terms in the Fibonacci sequence which are divisible by 3. As 
you can see, every fourth term so far is a zero and what we need to prove is 
that this pattern of zeroes continues. The essence of the argument is that we can 
continue the new sequence without reference to the original Fibonacci sequence. 
As each Fibonacci number is the sum of the preceding two, it follows that each 
term of our new sequence is obtained by adding the two preceding remainders 
and then taking the remainders on division by 3. (We shall ask you to prove a 
general result of this type in SAQ 17.) For example, successive terms 2 and 1 in 
the remainder sequence correspond to Fibonacci numbers of the forms 3k + 2 
and 3/ + 1, and the next term is (3k + 2) + (31 + 1) = 3(k +1+ 1) +0, which 
has remainder 0 on division by 3. Our observation merely states that this 
remainder could have been obtained directly by adding the remainders 2 and 1 
and then taking the remainder of the resulting sum. 


The proof is completed by observing that the remainder sequence consists of the 
pattern 1, 1, 2, 0, 2, 2, 1, 0 recurring indefinitely, since once a pair of successive 
terms are a repeat of an earlier pair the sequence must recur, each term being 
the sum of the two preceding terms. So every fourth term in the remainder 
sequence is 0 and hence every fourth term in the Fibonacci sequence is divisible 
by 3. 


[The underlying techniques contained in the above proof are illustrative of a 
powerful theory which we shall develop in general in the next unit. For a given 
positive integer n, replacing each integer by its remainder on division by n leads 
to a new ‘arithmetic’ on the set of remainders {0, 1, 2,...,n — 1}. Results in this 
simpler model translate into important properties of the integers themselves. ] 


The integers a and b have remainders r, and r, when divided by the positive 
integer n. Show that a + b and rı + r5 have the same remainder when divided by n. 
(We used the special case of this result with n — 3 in the above argument.) 


Write out the sequences of remainders when the terms of the Fibonacci 
sequence are divided by (i)4, (ii) 5 and (iii) 6. What conclusions can you draw 
about the occurrences of the multiples of these integers in the Fibonacci sequence? 
Justify your conclusion for case (i). 
Leta=qn+ry, OzXr,«n 
andb=q.n+r,, O<r, <n. 
Now let r, be the remainder when rı +r, is divided by n, ie. 

rı +, =q3n +r}, Oxr,«nm 
Adding the first two equations we get 

a+b=qnt+qn+r, 4r, 
=(4 +4 +q3)n+r3, Oxr, <n. 


Hence, by uniqueness of remainder, r, is the remainder on dividing a + b by n. 
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COMMENT We have confined attention here to the important case where the 
integer n is positive. The result is, however, true for all non-zero integers. Bl 


(1) L1; 2,3,1,0, 1, 1,2. 3; 1, 0... 


By the result of SAQ 17, each term of the remainder sequence is obtained 
by adding the two preceding terms and taking the remainder of this sum on 
division by 4. Hence, on arriving at the successive pair 1, 0 the next two 
terms 1, 1 are a repeat of the initial pair and so thereafter the sequence 
recurs. The sequence therefore consists of the pattern 1, 1, 2, 3, 1, 0 repeated 
indefinitely, every sixth term being 0, and consequently every sixth term of 
the Fibonacci sequence (and no other) is a multiple of 4. 


(ii) 1, 1, 2, 3, 0, 3, 3, 1, 4, 0, 4, 4, 3, 2, 0, 2, 2, 4, 0 1, 1,... 


The sequence consisting of the first 20 integers above recurs indefinitely and 
thus every fifth term of the Fibonacci sequence is divisible by 5. 


(iii) 1,1.2,3,5,2,1,5,4,1, 5,0, 5, 54, 3, 1,4.5, 3.2.5, 1,.0,.1, 1, 


The sequence consisting of the first 24 integers above recurs, and thus every 
twelfth term of the Fibonacci sequence is divisible by 6. B 


READ B page 288, line —9 to page 290, line —11. 


(i) Page 289, line —9 
In order to ensure positive subscripts in identity (1) we require the condition 
m 2 2. This however has implications for the proof of Theorem 13-2 where 
the use of (1) in the induction step includes the case m — 1. Fortunately, 
Theorem 13-2 is clearly true when m = | since then u,, = 1. An alternative 
way around this difficulty is to define an extra term u, of the Fibonacci 
sequence, satisfying the recurrence relation u, = u, + ug, Le. ug = 0. This 
now admits the case m — 1 in (1), and Theorem 13-2 goes through 
unaltered. 


(ü) Page 290, line 12 


Note that Theorem 13-2 can be expressed more succinctly as: 


if rls then u]u. A 


Use identity (1) and the values of the first twelve Fibonacci numbers given on B 
page 287 to find u,,, u;4 and u,,. Confirm that u,,|u5; and ug|u,,. 


Putting m — n — 11 in (1) gives 


Uy, = Wyo Uyy + Uy Uy. = U1 (Uio + 55) (note that u; ,[u5;) 
= 89. (55 + 144) 
= 17711. 


Putting m = 12 and n = 11 in (1) gives 
Uz3 = Uy Uy, + Uiu 
= 89? + 144? = 28657. 


Finally, 
54 = Uz3 + Uz, = 46368 


and 
46368 = 2208 -21 = 2208 uş. W 
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READ B from page 290, line — 10 to the end of Section 13.1. 


Page 292, line 4 

Having arrived at u,.4¢m,n) = Um B concludes that m = gcd (m, n). If all the 
Fibonacci numbers are distinct this would be valid, but unfortunately 

u, = u, = 1. Hence the conclusion that m = gcd(m, n) can only be drawn 
provided m > 2. However, Theorem 13-3 is trivially true in the cases m — 1 and 
m — 2, so all is well. 


This explains why the Corollary requires the hypothesis that m > 2. Since 
u, = 1 we have u,|u, for all positive integers n, but it is certainly not the case 
that 2|n for all positive integers n. A 


The Corollary to Theorem 13-3 tells us that for n 4 the Fibonacci number Up 
is composite whenever n is composite. This follows from the fact that if n>4is 
composite then it may be written as a product nın, where 2 <n, « n, and so 
Up, lup Where 1 < up, < up. 


We can also use the Theorem and Corollary to prove that the multiples of a 
given integer n which appear in the Fibonacci sequence do so with equal 
spacing. For, suppose that u,, is the first Fibonacci number which is a multiple 
of n. Since u,|u,,, for k = 1, 2,... it follows that n|u,,. Hence every mth term in 
the Fibonacci sequence is a multiple of n. On the other hand, these are the only 
multiples of n, for if nju, then, being a common divisor of u,, and u, n|gcd (u, u). 
That is, from Theorem 13-3, |, caes. Now as m was chosen to be the 

least positive integer with the property nlu,, it follows that m < gcd(m, 1). But 
certainly god (m, I) X m and so gcd(m,I) = m. This implies that l is a positive 
multiple of m so that u is one of the u,n: 


It might look as if this completes the proof of the result which B cites as 
cumbersome. Büt notice that all we have shown is that if a multiple of n 
appears in the Fibonacci sequence then infinitely many multiples of n occur 
with equal spacing. The proof that one multiple of n does occur will have to 
wait until the next unit. 

B page 293, Problems 13.1, number 8. 

Use the Corollary to Theorem 13-3 to 

(i) find where in the Fibonacci sequence multiples of 13 occur; 

(ii) show that 4|u, if and only if 8|u,. 

B page 293, Problems 13.1, number 10. 

cd(u,4, U36) = Ugcaira, 36) = U12, by Theorem 13-3. Therefore the common 


divisors of u;4 and u36 are the divisors of u12. By the Corollary, the Fibonacci 
numbers which divide u,, are 


u, =U, = 1, uy = 2, u4 = 3, ug = 8 and u,-—144 Hg 


(i) Note firstly that u, = 13. Hence, by the Corollary, 13|u, if and only if 7|n. 
This means that every seventh term is a multiple of 13. 


(i) As us = 8 we know that 8|u, if and only if 6|n, so the multiples of 8 in the 
Fibonacci sequence occur as every sixth term. However, we have already 
seen that the multiples of 4 also occur every sixth term. So every Fibonacci 
number which is a multiple of 4 is also a multiple of 8. E 


By Theorem 13-2, u, |u,,, and u,|u,,,.F urthermore, by Theorem 13-3, 
BCA (Um, Un) = Ugoden, = Uy = 1. So by Corollary 2 on B page 28 we have 
Unt]. 
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2.5 Epilogue 


In the unit we have seen plenty of evidence for the unpredictability of 
occurrence of the prime numbers. On the one hand, there appear to be infinitely 
many pairs of successive odd integers both of which are prime (twin primes), 
whilst at the same time there exist arbitrarily long sequences of consecutive 
composite numbers. But despite this local irregularity of their distribution, when 
we look at the primes overall, their global distribution can be described with 
rather more success. 


Towards the end of the eighteenth century, Gauss and Legendre observed 
independently that the sequence of primes ‘thins out’. To be more precise, if for 
each positive real number x we denote by n(x) the number of primes not 
exceeding x (so that, for example, z(6) = 3 and z(181) = 7), then z(x) is an 
increasing function whose rate of growth decreases as x gets large. This suggests 
that we might look with interest at the function z(x)/x which measures the 
‘density’ of the primes in the interval (0, x]. It turns out that z(x)/x bears a very 
strong resemblance to the function 1/log, x. 


zex) 


is in fact discontinuous at each prime. 
With the large scale we have taken 
on the x-axis these discontinuities 
do not show up 
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On analysing extensive tables of primes, Gauss and Legendre observed that as x 
gets larger and larger the values of x(x) seem to behave more and more like 
those of x/log, x. This led them to formulate a theorem which has become 
known as the Prime Number Theorem: 


à mx) |. 
i lin on z} " 


The effort they expended in formulating this result must have been really 
considerable. There were no efficient calculating devices at that time, and it is 
known that Gauss actually compiled by hand a table of all primes up to 
3,000,000 and investigated the number of primes occurring in each group of 
1000 consecutive integers. 


But Gauss and Legendre were unable to prove the Prime Number Theorem. It 
was through the collective genius and hard work of several mathematicians in 
the nineteenth century—notably Dirichlet, Chebyshev, Riemann, Hadamard and 
de la Vallée Poussin—that the theorem was finally proved in 1896 by 
Hadamard, and, independently, by de la Vallée Poussin. 


The proofs of the Prime Number Theorem make extensive use of complex 
function theory, utilizing properties of the so-called Riemann Zeta function. But 
in 1949, a purely arithmetic proof, not calling on techniques of complex 
analysis, was discovered by the Norwegian mathematician Selberg. The proof. 
however, is extremely complicated and beyond the scope of this, or any other 
first course in Number Theory. 
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2.6 


En route to proving the Prime Number Theorem by methods of complex 
analysis, many other results emerge which are interesting in their own right. 
One merits mention here. Many beginning students of mathematics are 
surprised on first learning that the harmonic series 

Lo dada 


1 1 
Er 1'3*3*43^ id 


is divergent (the sum is unbounded). If so, it must come as a shock to find that 
if all the terms are discarded except those where the denominator is prime then 
the resulting series " 


1 1. 1 


1 
qw 2 7 M. 


iol. 
ba eaa 


is still divergent. 


For more about the history of the Prime Number Theorem, some numerical 
evidence for its validity, and an insight into the technical content of its proof, 
you might like to read the first appendix of B, pages 344—350. 


Summary 


As B remarks, the notion of prime number is essential to every aspect of 
number theory. In the first section of this unit we established some of the basic 
results about prime numbers including the Fundamental Theorem of Arithmetic. 
and the prime factorization of an integer. 


In Section 2.2 we looked at the problem of investigating the primality of a given 
integer and demonstrated a reasonably practical method for listing all primes 
less than some-given integer N. Showing that there exist infinitely many primes, 
we then obtained crude estimates of the size of the nth prime. 


Section 2.3 gave some results concerning the distribution of the infinite set of 
primes within the set of positive integers. Included was the result that any non- 
constant polynomial with integer coefficients, and of an integer variable, takes 
on infinitely many composite values. On the other hand, Dirichlet's Theorem 
tells us that certain linear polynomials take on infinitely many prime values. 
Finally, a glance at Goldbach's Conjecture was used to illustrate the many 
questions which remain unanswered in this area of the subject. 


Section 2.4 introduced the Fibonacci sequence and investigated divisibility 
properties of the Fibonacci numbers. 


The main definitions and results encountered in this unit are listed in the 
corresponding entry in the Handbook. 
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3.0 


3.1 


A we yee 


Introduction 


There are many occasions when properties are shared by two integers provided 
they differ by a multiple of some fixed integer n > 2. In the case n = 2, two 
integers differ by a multiple of 2 when they have the same parity (even or odd) 
and, for example, (— 1)* = (— 1) precisely when a and b have the same parity. 
Taking n — 4, it is easily derived from the Division Algorithm that two integers 
have the same remainder (0, 1, 2 or 3) when divided by 4 if and only if their 
difference is a multiple of 4. Again it is true that gcd(a, n) — gcd(b, n) whenever 
a — b is a multiple of n (see SAQ 4). 


It is convenient to have a name and notation to describe this situation, and 
such is developed in the first section of this unit. Two integers a and b are said 
to be congruent modulo n provided a — b is a multiple of n. This unit and the 
next are wholly concerned with congruence, a concept which will occur 
frequently throughout the remainder of the course. 


Our first substantial application of congruence took place in the final section of 
Unit 2, although the terminology was not then at our disposal. Choosing a 
positive integer n, we replaced each Fibonacci number by the smallest non- 
negative integer to which it is congruent modulo n (its remainder on division by 
n), producing a new sequence which we referred to as the remainder sequence. 
Study of this simpler ‘congruent’ sequence led us to properties of the Fibonacci 
sequence itself. 


The unit comprises Chapter 4 of B. Section 3.1 investigates basic properties of 
congruence, involving essentially re-expression of certain divisibility properties of 
integers. The Division Algorithm plays a crucial role in this respect. Section 3.2 
turns to applications of congruence and, in particular, establishes tests for 
divisibility by certain integers. Section 3.3 is the core of the unit, looking at the 
problem of solving linear congruences (linear equations in which equality is 
replaced by congruence). This section culminates in a classical result concerning 
solutions of a system of simultaneous linear congruences, the so-called Chinese 
Remainder Theorem. As promised in Unit 2, Section 3.4 returns to our study of 
the Fibonacci sequence, illustrating the use of congruence in extracting 
properties of the sequence. 


This unit again puts an emphasis on problem solving. The main objective of the 
unit is to enable you to solve a variety of congruence problems, in addition to 
appreciating how congruence problems arise naturally in mathematics. To this 
end, the unit contains a good number of worked examples and SAQs, and we 
round off with the short tape-based Section 3.5, on further problem solving. 


Basic Properties of Congruence 


As the title suggests, this section considers the most fundamental properties of 
congruence. The results obtained on the laws governing addition, multiplication 
and cancellation for congruences are crucial for a full understanding of the rest 
of the course. Fortunately, the proofs are simple and you should have no 
difficulty in assimilating the results immediately. 


But first we learn something about the eminent mathematician Gauss, who laid 
the foundations of modern number theory and indeed first introduced the 
concept of congruence. 


| READ B Chapter 4, page 68, as far as page 72, line 13] 
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Page 71. line 14 

Although B's reference to the set of least positive residues modulo n is standard 
terminology, strictly speaking they should be called the set of least non-negative 
residues, since 0 is not positive. A 

B page 75, Problems 4.2, number 1. 

B page 76, Problems 4.2, number 7. 

(a) a = b(mod n) means that n|a — b. Since mjn it follows (by Theorem 2-2(4), B 

page 24) that m|a — b. That is, a = b(mod m). 


(b) As a = b(mod n) we have a — b = kn, for some integer k. Therefore, 
multiplying through by c, ca — cb — k(cn), which shows that cn|ca — cb. 
That is, ca = ch(mod cn). 


[Note that the condition c > 0 is needed because the modulus cn must be a 
positive integer. ] 


(c) From a — b — kn, division by d gives 


a b 
d d 
the latter being positive, this amounts to - 


n 
(mod) E 
d 
(a) As we are given n residues modulo n it remains to prove only that they are 
incongruent modulo n. Suppose therefore that 
c + ia = c + ja (mod n), where 0 € i, j < n. 


This means that n divides the difference (c + ia) — (c + ja) = (i — j)a. But 
from n|(i — j)a and gcd(a, n) = 1, Euclid's Lemma gives us n|(i — j) which in 
the language of congruence says i = j (mod n). The condition 0 € i, j <n 
now implies that i — j. 


(b) Any set of n consecutive integers is of the form 
c,c+1,c4+2,...,c+(n—1) 


and so all we need to do is observe that gcd(1,n) = 1 and invoke part (a) 
with a = 1 to obtain the required result. 


(c) By part (b), any set of n consecutive integers forms a complete set of 
residues modulo n, and so one of these integers must be congruent to 0 
modulo n, i.e. it is divisible by n. As one of the integers is divisible by n so 
too is the product. W 


Let us glance again at some of the steps in our solution to SAQ 2(a). From 
the initial supposition that 
c + ia = c + ja (mod n), 
the divisibility implications led us to n|(i — j)a, or, what is the same thing, 
ia = ja (mod n). 
So to reach this stage we have effectively subtracted c (ie. added —c) to the two 
sides of the congruence. Next we cancelled a to reach 
i= j (mod n) 
(this step being justified by Euclid's Lemma). 
Notice that these are precisely the steps we would take to deduce i — j if the 
initial congruence were replaced by ordinary equality. Indeed, there are certain 
similarities between the way we can manipulate congruences and the handling 


of corresponding equalities, which are listed in the next result, Theorem 4-2. The 
properties in the theorem are proved by falling back on the underlying 
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divisibility implications, but thereafter we shall learn to manipulate congruences 
by using these properties without resorting to divisibility. 


READ B from page 72, line 18 to page 74, line 5. 


The first three parts of Theorem 4-2 establish that, for each positive integer n, 
congruence modulo n is an equivalence relation. Part (1) is the reflexive 
property, (2) the symmetric property and (3) the transitive property. For each n 
then, we have a partition of the integers into disjoint classes, two integers being 
in the same class if and only if they are congruent modulo n. These classes are 
called the residue (or congruence) classes modulo n. ? 


In the case n — 4 the residue classes are 
{..., —8, —40,4, 8...) 
(77, —3,1,5, 9,...] 
£...,—6, —2,2,6,10,...) 
{..., -5, —1,3,7,11,...] 


corresponding to the set of least positive residues {0, 1, 2, 3}. Selecting one 
number from each residue class, e.g. ( —8, 5, —2, 3}, will always produce a 
complete set of residues, as we have the correct number of residues and no two 
of them can be congruent since they belong to different residue classes. Another 
choice of residues which often proves useful is the set {— 1,0, 1,2}, where 

from each class we have selected a residue of least absolute size. For general 
n> 1, the set of least absolute residues consists of the n integers which satisfy 
the inequalities —n/2 < x € n/2. For example, n = 7 gives the set 

1-3, -2, —1, 0, 1, 2, 3}. 


J 


Part (4) of Theorem 4-2 is of great importance. In essence it says that the 
residue class to which a sum a + c or a product ac belongs does not depend so 
much on the individual numbers a and c but rather on the residue classes to 
which they belong. Denote by [r] the residue class to which r belongs. This will 
mean that residue classes are denoted in many different ways; for instance, 
taking n = 5, [C1] = [4] = [19] since —1 = 4 = 19 (mod 5). However, 
corresponding to the set of least positive residues, the set 


Z, = (I0), [1]... [n- 1] 
is the set of all residue classes modulo n. Reinterpreting in terms of residue 
classes, (4) tells us that if [a] = [b] and [c] = [d], then [a + c] = [b +d] and 
[ac] — [bd]. This implies that the operations given by 
[a] + [b] = [a+ b] 
and [a] x [b] = [ab] 


are well defined (unambiguous) on Z,. For example, when n = 4 we obtain the 
ollowing tables 
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where, for instance, [3] x [2] = [2] is obtained from the definition 
[3] x [2] = [3:2]=[6], and the fact that [6] = [2] since 6 = 2 (mod 4). 


In practice, we can reduce the amount of arithmetic in congruence problems by 
working with the numbers in a complete set of residues alone (the most frequent 
choice being the set of least positive residues although occasionally it is more 
appropriate to work with the least absolute residues). After each sum or product 
we replace the answer by the chosen residue to which it is congruent. That is, 
we are really using the residue class arithmetic defined above but avoiding much 
of the formalism. Here are a couple of illustrations. 


B page 76, Problems 4.2, number 6. 


To verify that 0, 1, 2, 27, 23,...,2° is a complete set of residues modulo 11 we 
show that, in some order, these integers are congruent modulo 11 to the 
members of the set of least positive residues. 


22-4; 2548; 24=16=5; 25-2.2* 22.5210; 2622.10 9; 
222.927; 2222.723; 29=2-3=6. 
Together with 0, 1 and 2 these give 10, 1, 2, 3, 4,..., 10}. 


That 0, 1?, 22,..., 10? is not a complete set of residues modulo 11 follows since 
these 11 integers are not incongruent; for instance, 1? = 10? (mod 11). Indeed, 

since 11 — r = —r (mod 11), Theorem 4-2(6) gives (11 — r} = (—r)? = r°, and 
taking r — 1,2,3,4 and 5 identifies five congruent pairs amongst this set. 


Show that there do not exist integers x and y for which x? + y? = 999, 


At first glance it is difficult to see what this question has to do with congruence. 
In fact, it is a particular instance of a general result concerning the possible 
values that the sum of two squares can take when we approach the problem 
working with congruence modulo 4. 


Any integer x is congruent to 0, 1, 2 or 3 modulo 4. Theorem 4-2(6) tells us that 
x? is congruent to one of 0?, 17, 2? = 0 or 3? = 1, modulo 4. That is, 

x? = 0 or 1 (mod 4). Similarly, for any integer y, y? = 0 or 1 (mod 4). 

Theorem 4-2(4) now gives x? + y? = 0, 1 or 2 (mod 4) (the possible values 
obtained adding 0 or 1 to 0 or 1). 


We conclude that no integer which is congruent to 3 modulo 4 can be expressed 
as the sum of two squares. But 999 = 3 (mod 4), and so x? + y? = 999 has no 
integer solutions. 


There is one further result of equal importance to those of Theorem 4-2 which 
we include here for future reference. 
If a = b (mod m) and a = b (mod n) when ged(m, n) = 1, then a = b (mod mn). 


In fact this result follows immediately from Corollary 2 on B page 28. From the 
definition of congruence we are given that m|a — b and n|a — b. Coupled with 
gcd (m, n) = 1, the Corollary gives mn|a — b. That is, a = b (mod mn). B 


Before you start work on the self-assessment questions, one word of warning. 
Don't fall into the trap of thinking that because x = y (mod n) it will follow that 
a = a’ (mod n). A simple counterexample is provided by the fact that 

1 = 4 (mod 3) but 2! ¥ 2* (mod 3). 

B page 76, Problems 4.2, number 4. 

B page 75, Problems 4.2, number 3. 


Show that if a = b (mod m) and a = b (mod n) then a = b (mod I), where 
l = lem(m, n). Derive from this another proof of the Theorem preceding SAQ 3. 
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B page 76, Problems 4.2, number 10, parts (a), (b), (c) and (d). 


(a) 2° = 8 = 1 (mod7). 
Therefore 2^9 = (23)16 = 116 = 1 (mod 7) 
andso 2° = 2?.2*8 =4.1 = 4 (mod7). 
The remainder on dividing 25° by 7 is 4. 


41 = 6 (mod 7) (or, if you prefer, 41 = —1 (mod 7)). So 41? = 1 (mod 7), and 
41°° = 41- (412)? = 6.1?? = 6 (mod 7). 


The remainder on dividing 41° by 7 is 6. 


(b) 1 =5 3 9=---= 97(mod4), 
2=6=10=---= 98(mod4), 
3=7=11=---= 99(mod 4), 
4=8 =12=---=100(mod4). 
Therefore 


1? + 2° 4+ 35 +- + 1005 = 25. (15 + 25 + 35+ 45) (mod 4) 
= 25-(1° + 25 + (—1)5 + 0) (mod 4) 
= 25-(1+0+ —1+40)=0(mod 4). 
The remainder on dividing the sum by 4is0. BN 


Since a = b (mod n) we have a — b = kn, for some integer k. Let d, = gcd(a, n) 
and d, = gcd(b, n). Since d,|a and d,|n it follows that d,|a — kn. That is, d,|b. 
Therefore d, is a common divisor of b and n, and so d, < d;. 


On the other hand, d,|b and d,|n gives d;|b + kn. That is, d;|a and so d, is a 
common divisor of a and n, giving d; < dı. Hence d; = d). BI 


Let d = gcd(m, n), so that | = lcm(m,n) = T, To show that a (mod l) we 


=b 
; ; : mn 
must show that there exists an integer t such that a — b = t ul 


Given that a = b (mod m) and a = b (mod n) we know that there exist integers r 
and s such that 


a—b=rm=sn. 


From rm = sn we obtain r (i S i) where 5 and 5 are integers with 
mn r Ia 5 
gcd qul 1. It follows from Euclid's Lemma that J divides r, and putting 
n|. ; : mn : 
r= (i in the above equation yields a — b = t I, as required. 


When gcd (m, n) = 1, lem (m, n) = mn and this result reduces to our earlier 
theorem. W 


We attempted several problems of this kind in Unit 1, where we used the 
Division Algorithm. This time we are going to attack them using ideas of 
congruence. But notice that as the congruence properties all arise from 
divisibility conditions, the arguments are going to be essentially the same as 
those of Unit 1. However, we now have more succinct terminology to use and, 
hopefully, some improved techniques. 


(a) Working with the set of least positive residues, any odd integer is congruent 
to one of 1, 3, 5 or 7 modulo 8. The result follows since 1? 2322 522 7? =1 
= 1 (mod 8). 
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(b) Working with the set of least absolute residues modulo 9, and investigating 
possible values of a°: 
0-0 p-1 2-8 1, 32-2720, #=64=1, 
(4? = -4# = —1, (-3? =-3?=0, (-2p--2cz1 (-1P--L 


So, for any integer a, a> = 0, 1 or — 1 (mod9). 


(c) Confirmation that, working modulo 6, 
(-2p2-2, (-1P 2-1 020, P521 2522 and3??23 


establishes a? = a (mod 6), for all integers a. 


(d) To show that a? = 1 (mod 24) it suffices to show that a? = 1 (mod 3) and 
a’ = | (mod 8), and appeal to our earlier theorem. Since we are only 
considering odd integers a, that a? = 1 (mod 8) has been shown in part (a). 


Since 3/a, a= 1 or 2 (mod 3). So a? = 1? or 2? (mod 3). That is, 
a? = 1 (mod 3), and the result follows. Bl 


READB from page 74, line 6 to the end of Section 4.2. 


The result of Theorem 4-3 should convince you that the utmost care is needed 
when you are tempted to cancel a common factor from both sides of a congruence. In 
fact, the theorem states that in doing so you in general produce a congruence 
with a different modulus. The two Corollaries describe situations where the 
modulus remains unchanged. The second Corollary, stated in terms of the 
arithmetic of Z,, where p is a prime, bears a striking resemblance to the 
cancellation law in the integers, since it says that if [c] [a] = [c] [b] where 

[c] # [0], then [a] = [b]. 


Use an appropriate cancellation to simplify each of the following congruences. 
(a) 6x = 18 (mod 30) 

(b) 20x = 30 (mod 45) 

(c) 52x = 39 (mod 67) 

B page 76, Problems 4.2, number 13. 


(a) 6x = 18 (mod30) => x =3 (mod5), since ged (6, 30) = 6. 


(b) 20x = 30 (mod 45) => 2x =3 (mod9). Since we are cancelling a factor 
10 and gcd(10,45) = 5 we must divide the modulus by 5. 


(c) 52x = 39 (mod 67) => 4x =3 (mod 67), since ged(13,67)=1. B 


The result in question here is a slight variation on Corollary 1. It says that we 
can cancel congruent numbers on either side of a congruence, provided they are 
relatively prime to the modulus. 


Given ab = cd (mod n) and b = d (mod n), we have integers r and s such that 

ab — cd = rn and b — d = sn. Using the second of these equations to eliminate d 
from the first produces ab — c(b — sn) — rn. That is, (a — c)b — (r — sc)n. From 

the fact that gcd(b, n) = 1, Euclid's Lemma now gives nla — c, which amounts to 
the required congruence. W 
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M335 32 
Special Divisibility Tests 


Given any integer (written in our usual decimal notation) we can tell at a glance 
whether it is divisible by 2 or 5 without carrying out the division. To test 
divisibility by 5, for example, we merely look to see whether the final digit is 0 
or 5. The section introduces tests for divisibility by 3, 4, 8, 9 and 11 and shows 
how these tests arise out of congruence considerations. 


l READ B Section 4.3, page 77, as far as page 79, line 13. | 


Page 77, line —4 
A somewhat shorter way of presenting the uniqueness argument is to assume that 
the smallest value of i for which a; # c; is k, say, so that 


amb” + ay b" E + + Hab = eb" + Cy 71 H H b, 
Cancellation of the common factor b* now gives 

(a, b" 77! E gg BA ap = (eb 3 eso C, 1)D + 6 
and since 0 X a,, c, « b, the uniqueness of remainder guaranteed by the 


Division Algorithm gives a, — c,—a contradiction. A 


Determine the base b place value notation for 170 where (a)b=2, (b)b— 7, 
(c)b —8 and (d)b — 11. 


(a) Following the application of the Division Algorithm on B page 77, 


170 = 85-20 
85—42.241 
42 -21.24-0 
21 210.241 
10= 5.240 
5-2 2.241 
2-2 1.240 
12 0.241 


So 170 — (10101010), 
(b) 170 = (332),. 

(c) 170 = (252),. 

(d) 170 = (145),,. M 


READ B from page 79, line 14 to the foot of page 79. 


Theorem 4-4 and its Corollary have numerous applications in finding (or more 
often establishing non-existence of !) integer solutions of polynomial equations. 
Here is an example to illustrate the general technique. 
Show that the polynomial equation 

P(x) = 8x? — 19x? -9x 6-0 


has no integral solutions. 
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The trick for this particular equation is to consider the possible residues of P(x) 
modulo 5. By the theorem, it suffices to let x take all values in a complete set of 
residues modulo 5, say (0, 1, 2, 3, 4}, for if x is any integer than it is congruent 
to one of these five residues, and consequently P(x) is congruent modulo 5 to 
one of P(0), P(1), P(2), P(3) or P(4). To determine these values we can first 
simplify the polynomial P(x) by reducing its coefficients modulo 5: 


P(x) = 330 + x? + x + 1 (mod 5). 
Then 


AE T 2 3 4 
P(x)(mod5) | 1 1 1 4 3 


shows that the congruence P(x) = 0 (mod 5) has no solutions. It now follows 
that P(x) = 0 cannot have any integral solutions, because if x is an integer 
such that P(x») = 0, then P(x) = 0 (mod n) for every positive integer n, 
contradicting what we have shown to be the case for n — 5. 


Note that the above argument would have failed if instead we had worked 

modulo 2, modulo 3 or modulo 4, for we would have found that 

P(0) = 0 (mod 2), P(0) = 0 (mod 3) and P(2) = 0 (mod 4), despite P(x) = 0 
having no integral solution. In other words, no solution to a corresponding 
congruence guarantees no solution to the original equation, but a solution 

to the congruence does not settle the equation question either way. 


Show that the following equations have no integral solutions. 

(a) x* - x? cx -x-120 

(b) 3x* — 4x3 + 13x? — 5x + 4 = 0. 

For both parts we adopt the method of Example 3. To show that the equation 


P(x) = 0 has no integral solutions it is sufficient to find a positive integer n such 
that the corresponding congruence P(x) = 0 (mod n) has no solutions. 


(a) Let P(x) = x* + x? + x? + x +1 and consider the congruence 
P(x) = 0 (mod 2). Taking {0,1} as complete set of residues modulo 2, 
P(0) = 1 (mod 2) and P(1) = 1 (mod 2) confirms that the congruence 
has no solutions. 


(b) Let P(x) = 3x* — 4x3 + 13x? — 5x + 4. Since P(0) = 4, we see that the 
congruence P(x) = 0 (mod 2) has a solution, and so working modulo 2 will 
not help us. Try modulo 3. First simplifying the coefficients modulo 3, we 
hope to find that the congruence 


P(x) = 2x? + x? + x + 1 = 0 (mod 3) 


has no solutions. Substituting x = 0, 1 and 2 we obtain P(0) = 1 (mod 3), 
P(1) = 2 (mod 3) and P(2) = 2 (mod 3), confirming that the congruence 
indeed has no solutions. W 


READ B from the top of page 80 to the end of Section 4.3. 


B page 81, Problems 4.3, number 5. 


B page 81, Problems 4.3, number 6, with the additional part: 
(e) Find a test for divisibility by 8. 


B page 81, Problems 4.3, number 10, part (a). 
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SOLUTION (a) The 9-test won't help here as it fails to distinguish between the digits 0 and 
TO SAQ 11 9. Using the 11-test: 
5281727—148—24 5= 17 = 6 (mod 11), 
3212146 =6—4+1-—2+1-—2+3=3 (mod11), 
and so 52817 -3212146 = 6-3 = 7 (mod 11). 


Now 169655x15282 =2—8+2-—5+1-—x+5-5+6-9+6-1 
= 5 — x (mod 11). 


Hence 5 — x = 7 (mod 11) (and 0 < x < 9) gives x = 9. 
(b) [3(523 + x)]? = 9(523 + x)? = 0 (mod 9). 
Using the 9-test: 
2x99561 2 2-- x +9+9+5+6+1=32+x= 5 + x (mod9) 


So we require 5 + x = 0 (mod 9), which gives x = 4 as the only 
possibility. Bi 
SOLUTION Throughout this solution we let N = a,,10" + a, ..,10" ^! + --- +.a,10 + ay 
TO SAQ 12 where 0 X a; < 9 for each i. 
(a) Since 10 = 0 (mod 2), N = a, (mod 2) and N is divisible by 2 (ie. is 
congruent to 0 modulo 2) if and only if aj is divisible by 2. This occurs 
when a, = 0, 2, 4, 6 or 8. 
(b) Since 10 = 1 (mod 3), 10* = 1 (mod 3) for all k 2 1 and so 
N =4,, dy, +... d, + do (mod 3). The result follows. 


(c) As 10* = 0 (mod 4) for all k = 2 we have that N = a,10 + ay (mod 4) which 
gives the required result. 


(d) As 10 = 0 (mod 5), N = a, (mod 5) showing that N is divisible by 5 precisely 
when a, — 0 or 5. 


(e) Extending the idea of part (c) gives us a test for divisibility by 8. Since 
8|1000 it follows that 10* = 0 (mod 8) for all k > 3. Therefore 
N = a410? + a,10 + ay (mod 8). This means that N is divisible by 8 if 
and only if the integer formed by taking the final three digits of N is 
itself divisible by 8. IN 


SOLUTION Since N and M are made up of the same digits (albeit in different orders), the 
TO SAQ 13 sum of the digits in N is equal to the sum of the digits in M. By the 9-test, 
N = M (mod 9) and so 9N — M. B 


3.3 Linear Congruences 


In the previous section we introduced the idea of solving congruences 
P(x) = 0 (mod n), where P is a polynomial with integer coefficients. In the 
particular case when P has degree 1 we have a linear congruence. The next 
reading passage is concerned with the solutions of linear congruences. 


As we shall see, there is a very strong relationship between linear congruences 
and linear Diophantine equations; indeed, our work on the latter (Section 1.7 of 
Unit 1) leads us to a result on the solvability (and number of solutions) of the 
general linear congruence. In practice, however, we shall acquire techniques for 
solving linear congruences without resorting to the method of Unit 1 (where we 
employed the Euclidean Algorithm). 
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(iii) 
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READ B Section 44 from page 82 as far as page 85, line — 6. 


Page 83, line 12 

The proof of Theorem 4-7 may, at first glance, look long and involved, but 
in reality there is very little to it. Observing that the solutions of 

ax = b (mod n) are precisely the x-values in solutions of the linear 
Diophantine equation ax — ny — b, Theorem 2-9 (on B page 40) has 
already done the bulk of the work. That theorem tells us that the linear 
congruence has solutions if and only if d divides b, where d = gcd (a, n) 


s P S nt 
and if x, is one solution then the others are the values of x, — T as t 


runs through all integers. All that remains here is to determine how many 
of these solutions are different, that is, how many are incongruent 
modulo n. The claim made is that if x = x, is one solution then 


n 


d 


: n 2n 
Xo; Xo typ Xo Serr 
is a ‘complete’ set of d incongruent solutions modulo n. To justify this we 
must show that (i) no two members of this set are congruent modulo n, 
n 
d 
modulo n to one of the members of this set. (i) follows immediately when 
we observe that the maximum difference between members of the 


, Xo + (d — 1) 


and (ii) that any solution, ie. xy — =t for any integer t, is congruent 


set is (d — 15 which is less than n, so the difference between distinct 


members of the set cannot possibly be divisible by n. As for (ii), let t be 
any integer and note that t = r (mod d) for some r belonging to 

(0, 1, 2,..., d — 1}, which is a complete set of residues modulo d. Then by 
SAQ 1, ti = " (mod n) and, by Theorem 4-2(4), x, + : Zr 


t = xo + st (mod n) 


as required. 


Page 85, line 19 

If you do not follow the sentence in brackets explaining the simplification, 
do not worry about it at this stage. We shall expand this point and look 
for other such simplifications in SAQ 15. 


Page 85, line —6 

In Example 4-7, the linear congruence 9x = 21 (mod 30) is said by B to be 
‘equivalent’ to the linear congruence 3x = 7 (mod 10). This is justified by 
the fact that the two congruences have the same solution set: an integer x 
is a solution of one of the congruences if and only if it is also a solution of 
the other. And yet there appears to be a distinction when, having found 

x = 9 (mod 10) to be the unique solution of 3x = 7 (mod 10), we find that 
the congruence 9x = 21 (mod 30) has three incongruent solutions modulo 
30. But these two solutions sets are the same; the set of integers which are 
congruent modulo 30 to either 9, 19 or 29 is precisely the residue class of 9 
modulo 10. The only difference is in the choice of expressing the solution 
in terms of modulo 10 or 30. A 


A natural question here is to ask why we should want to express the solution in 
terms of modulo 30 when we have a more compact solution in terms of modulo 
10. Indeed, given a linear congruence ax = b (mod n) in which d|b, where 

d — gcd(a, n), why do we not always start by dividing through by common 
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s , b 
factor d and look instead for the unique solution of m =7 


the real result we need is not Theorem 4-7 but rather its simpler Corollary? The 
answer lies in the nature of the problem in which the linear congruence to be 
solved has arisen. If we are asked to solve, in isolation, the linear congruence 
ax =b (mod n), then the best approach is to remove all common factors of a, b 
and n and aim for the unique solution (if there is one) in terms of the reduced 
modulus. In practice, however, if the linear congruence arises at some stage of a 
problem, frequently it will be necessary to solve the congruence in terms of the 
original modulus n. To do this we can either find one solution and then appeal 
to Theorem 4-7 (as B did in Example 4-6), or we can remove common factors, 
find the unique solution in terms of the reduced modulus, and then re-express 
the solution in terms of the original modulus (as B did in Example 4-7). 


n 
mod , so that 


SAQ 14 Use Theorem 4-7 to solve the linear congruences 
(a) 8x = 34 (mod 60), 
(b) 4x = 13 (mod 47), 
(c) 6x = 15 (mod 33), 
(d) 95x + 20 = 0 (mod 55). 


SAQ IS (a) If gcd(a,n) = 1, prove that 10. a, 2a,..., (n — 1)a} isa complete set of 
residues modulo n. Deduce that there exists an integer c such that 
ca = 1 (mod n). 


(b) If gcd (c, n) = 1, prove that the congruences ax = b (mod n) and 
cax = cb (mod n) are equivalent. Hence, 


(c) solve the following linear congruences: 
(1) 5x = 11 (mod 19), 
(ii) 3x = 19 (mod 29), 
(iii) 17x =6 (mod 49), 
(iv) 11x =4 (mod 75). 


(d) What is wrong with the following argument? 
To solve 7x = 1 (mod 40). 
Multiplying though by 6 gives 
42x = 6 (mod 40). 
That is, 2x = 6 (mod 40), 
or x = 3 (mod 40). 
[Note that 7-3 x 1 (mod 40), so something has gone wrong!] 
SAQ 16 B page 89, Problems 4.4, number 2. 


SOLUTION (a) Since gcd(8,60) = 4 and 4/34 this congruence has no solutions. 


TO'SAQA4 (b) Since gcd(4, 47) = 1 this congruence has a unique solution modulo 47. 


Observing that 13 = 60 (mod 47), the congruence can be written as 
4x = 60 (mod 47) from which we ‘spot’ the solution x = 15 (mod 47). 


COMMENT If you failed to spot this solution you could have found it by 
resorting to the method of Section 1.7, using the Euclidean Algorithm to solve 
the linear Diophantine equation 4x + 47y = 13. As a general rule, however, you 
should be able to avoid falling back on this method. SAQ 15 outlines a much 
improved method, and in Section 3.5 we discuss a systematic approach to 
solving linear congruences where the overall strategy is to keep simplifying the 
congruence until such time as a solution can be spotted. 
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(c) gcd(6,33) = 3 and 3|15 so the congruence has three solutions modulo 33. As 
15 = 48 (mod 33), we spot that x = 8 (mod 33) is one solution. By Theorem 


4-7 the others are x = 8 + - z19 (mod 33) and x = 8 + L2 =30 (mod 33). 


(d) Since 95 = 40 (mod 55) and —20 = 35 (mod 55) we first rewrite the 
congruence as 40x = 35 (mod 55). Now gcd(40, 55) = 5 and 5|35 so the 
congruence has five solutions modulo 55. One of them is seen to be 
x = 5 (mod 55) and so by Theorem 4-7 the complete solution is 
x = 5, 16, 27, 38, 49 (mod 55). M 


SOLUTION (a) As the given set contains n residues we need show only that no two of them 


TO SAQ 15 are congruent modulo n. This follows immediately, for if ra = sa (mod n). 
the given condition gcd(a,n) = 1 permits us to cancel a, obtaining 
r = s (mod n). But (0, 1, 2,..., n — 1} is a complete set of residues modulo n 


and so no two can be congruent. 


Since (0, a, 2a,..., (n — 1)a} is a complete set of residues, one of the 
members, say ca, is congruent to 1 modulo n. 


(b) If ax = b (mod n) then certainly cax = cb (mod n), since n|ax — b implies 
n|c(ax — b). 


On the other hand, if cax = cb (mod n) where gcd(c, n) = 1, then 
cancellation yields ax = b (mod n). 


(c 


— 


All the congruences have a unique solution to the given modulus. 
(i) Multiply through by 4: 
20x 244 (mod 19) « x= 6 (modl9). 
(ii) Multiply through by 10: 
30x = 190 (mod29) <= x= 16 (mod 29). 


In each of the two remaining congruences it is not easy to spot which 
integer to multiply by so as to reduce the coefficient of x to 1, although 
the existence of such an integer is ensured by part (a). Undaunted, we 
can still use part (b) to simplify the congruence. 


(iii) Multiply through by 3: 
51x = 18 (mod 49) <> 2x =18 (mod 49). 
The solution x = 9 (mod 49) is now apparent. 
(iv) Multiply through by 7: 
77x = 28 (mod75) <> 2x = 28 (mod75). 
This has solution x = 14 (mod 75). 


(d) There are two errors in the argument. To start with, look at the final step. 
From 2x = 6 (mod 40) we attempted to deduce that x = 3 (mod 40). But 
gcd (2, 40) = 2 and so the former congruence has two solutions modulo 40. If 
we wish to cancel the common factor 2 then the correct deduction is that 
x = 3 (mod 20) which gives the two solutions x = 3, 23 (mod 40). (Notice that 
x = 23 (mod 40) is the correct solution of the original congruence.) This 
explains why the argument failed to produce the correct answer, but still 
does not account for the presence of the erroneous solutions. 


The initial error is in the first step where we multiplied through by 6. Since 
gcd(6,40) 4 1 we cannot do this! Notice the resulting congruence 

42x = 6 (mod 40) has ged (40, 42) = 2 incongruent solutions modulo 40, 
whereas the congruence to be solved has a unique solution modulo 40. Bl 
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SOLUTION (a) Any solution of 4x + 51y = 9 requires 4x = 9 (mod 51) and 51y = 9 (mod 4), 
TO SAQ 16 so we first solve these congruences. 


4x =9(mod51) < 13-4x =13-9(mod51) — x=15(mod51) 
Sly =9(mod4) <> 3y=1(mod4) <=> y=3(mod4) 


So in any solution, x is of the form 15 + 51t and y of the form 3 + 4s and 


to find which values of s and t produce solutions we substitute back into the 
original equation: 


4(15 + 51t) + 51(3 + 4s) = 213 + 204s + 204t = 9 
which gives s+ t = —1. 


The solution set is therefore 


(x =154+ 51t, y= —1— 4t: for any integer t}. 

(b) 12x = 331(mod25) <= 12x = 6(mod25) 

<= 24x =12(mod25) <= x= —12 = 13 (mod 25) 
25y = 331 (mod12) «— y=7 (mod12) 
So x=13 + 25t, y=7 + 12s and 
12(13 + 25t) + 25(7 + 12s) = 331 + 300t + 300s = 331, 
giving s + t = 0. So the solution set is 
{x =13 + 25t, y —7 — 12t: for any integer t}. 

(c) 5x = 17 (mod 53) <= 5x= 70 (mod53) — x= 14 (mod 53) 
53y = —17 (mod5) <>» 3y=3(mod5) — y= 1 (mod5) 
So x=14 + 53t, y=1 + 5s and 

5(14 + 53t) — 53(1 + 5s) = 17 + 265t — 265s = 17, 


so that s = t. The solution set is therefore 


{x =14+ 53t, y=1 + 5t: for any integer t}. NI 


READ B from page 86, line 14 to page 88, line — 12. 


NOTE Page 87, line 12 
The Chinese Remainder Theorem is proved by actually constructing a 
simultaneous solution. This is the integer X appearing in the tenth line of B's 
proof. There are, however, a large number of variables involved and it might be 
helpful to read the numerical Example 4-8 on page 88 at the same time, to help 
keep track of what is what as you read the proof A 


SAQ 17 B page 89, Problems 4.4, number 4, parts (c) and (d). 


SOLUTION (c) To follow the steps of the Chinese Remainder Theorem we first note that 
TO SAQ 17 


4,—5, a4=4, a,=3, n, —6 n,=11, andn,—17 


Then n = n, n,n; = 1122, N, = n,n = 187, N, =n, n, = 102 and 
N, = n, n, = 66. We next solve the congruences N;x = 1 (mod n). 


N,x = 187x = 1 (mod 6) <= x =1(mod6), so x, = 1 (mod 6) 
N x = 102x = 1 (mod 11) — 3x = 1 (mod11) 

has solution x, = 4 (mod 11) 
N,x = 66x = 1 (mod 17) <= 15x =1(mod17) 

has solution x, = 8 (mod 17) 
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Now 
X = 5-187-14+4-102-4+3-66-8 = 4151 
and so the unique solution of the system is 
X = 4151 = 785 (mod 1122). 


(d) There is an initial stage to this problem in that we must first express each of the 
congruences in the form x = a; (mod n;). 


2x =1(mod5) <=> x=3(mod5) 
3x =9=3(mod6) <> x=1(mod2) 
4x 21(mod7) <= x =2(mod7) 
5x =9(mod11) <> xz4(modll) 


To solve the system using the Chinese Remainder Theorem is now just 
(careful) arithmetic. The variable values are 


a,=3, a =1, a,=2, a,—4 


n5, n22, n3=7, m= ll 
n=710, N,=154, N,=385, N,;=110, N,=70 
x, =4(mod5), x, = 1 (mod 2), x, =3(mod7), x4 =3(mod11) 
and finally 


E 


= 3733 = 653 (mod 770). W 


The two examples in SAQ 17 illustrate a drawback in using the method 
suggested by the proof of the Chinese Remainder Theorem to solve 
simultaneous linear congruences, namely that the amount of arithmetic involved 
can become tedious. There is an alternative method. 


Suppose that a system of simultaneous linear congruences fulfils the conditions 
of the Chinese Remainder Theorem. Then we are assured of a unique solution 
modulo n, where n is the least common multiple of the moduli in the system. A 
simple-minded approach would be to find all incongruent solutions modulo n of 
each congruence individually and then pick out the unique common one. How 
we might do this in practice is illustrated by the following. 


EXAMPLE 4 Solve the simultaneous congruences 
x =9 (mod 13), x =3(mod5), x = 1 (mod 3). 


x =9(mod13) = x=9,22, 35, 48, ... (Increase by 13 until we 
| reach a solution of the 
and 
second congruence) 
x=3(mod5) == x = 48, 113,178,... (Increase by 5-13 until 
ad | we reach a solution of the 


third congruence) 
x=1(mod3) == x = 178 (mod 3 - 5- 13) 

The Chinese Remainder Theorem tells us that the first two congruences have a 
unique simultaneous solution modulo 5- 13, which we find to be 48. Amongst 


those integers in the residue class of 48 modulo 65 we seek the least positive 
member which is congruent to 1 modulo 3. 
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In Example 4-9, B uses this method, only he backs up his working with 
algebraic reasoning. 


READ B Example 4-9 on pages 88 and 89. 


This example illustrates one situation in which simultaneous linear congruences 
arise naturally. If n has canonical prime factorization n = p py... p*, then 
solving the linear congruence ax = b (mod n) is equivalent to solving the 
simultaneous system 


ax = b (mod p''),..., ax = b (mod p*). 
The equivalence follows from our Theorem on page 47 which tells us that any 
simultaneous solution of the system must be a solution of the original 
congruence. On the other hand, any solution of ax = b (mod n) certainly 
satisfies ax = b (mod p), since pln, fori <i<r. 
B page 89, Problems 4.4, number 5. 
B page 90, Problems 4.4, number 10. 


We first solve the individual congruences. 
17x 23(mod2) <> x=1 (mod 2) 
17x 23(mod3) — 2x 20(mod3) <> x = 0(mod 3) 
17x =3(mod5) <> 2x =3(mod5) <> x = 4 (mod 5) 
17x =3(mod7) <= 3x 23 (mod 7) <> x=1(mod7) 


x=1(mod2) = x=1,3,... (Increase by 2 until we reach 
aad | x = 0 (mod 3)) 
x =0(mod3) => a 5. AN (Increase by 2 - 3 until we 
and l reach x = 4 (mod 5)) 
x=4(mod5) = x = 9, 39, 69,99, ... (Increase by 30 until we 
and | reach x = 1 (mod 7)) 
x=1(mod7) => x =99(mod210) i 


Let x be the number of gold coins. The given facts translate into the 
congruences 


x =3(mod17), x =10 (mod 16), x =0 (mod 15) 
and our task is to find the least positive simultaneous solution. 


This time we use the algebraic method of Example 4-9. From the third 
congruence we have x = 15k, for some integer k. Substituting into the second 
congruence we have 15k = 10 (mod 16), or 3k =2 (mod 16), which solves to give 
k = 6 (mod 16). Writing k = 161 + 6 we now have x = 15(161 + 6) = 2401 + 90, 
and any number of this form satisfies the second and third congruences. Finally, 
we substitute into the first congruence: 


2401 + 90 2 3 (mod17) 214523 (mod 17) 
which gives | = 16 (mod 17). So writing | = 17m + 16, we have 
x = 240(17m + 16) + 90 = 4080m + 3930 


so that the solution of the system is x = 3930 (mod 4080), and the number of 
gold coins is 3930. g 
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The Fibonacci Sequence Revisited 


In the final section of Unit 2 we studied B Section 13.1, which was concerned 
with the Fibonacci sequence and some of its properties. Amongst other things, 
we proved the following theorem for the particular cases m = 3, 4, 5 and 6. 


For any integer m > 1 there exist infinitely many Fibonacci numbers which are 
multiples of m; these multiples occur with equal spacing in the Fibonacci 
sequence. 


The Corollary on B page 292 went some way towards proving this Theorem, as 
it showed that if one multiple of m occurs in the Fibonacci sequence then 
infinitely many do, with equal spacing. To complete the proof it remains to 
show only that one multiple of each integer m 2 1 occurs as a Fibonacci 
number. This is the goal of the section, but to achieve it we first consider a 
different approach to the Fibonacci sequence which, as a by-product, discovers 
new proofs of the divisibility properties and new identities between Fibonacci 
numbers. Everything hinges on the following result: 


Let A be the matrix A = ( | 


1 i} Then for all integers n = 1, 


Wü. u ; y 
A" = h 2 i. f where u, are the Fibonacci numbers, and u, = 0. 
n n+1 


(by Mathematical Induction) 


Since uy = 0 and u, = u, = 1, the result is certainly true when n = 1. 


Suppose that A* = Ei Bi or some integer k > 1. Then, 
k k+1 
ARYL = AAR = 0- Ly fuc "714 
1 1) (u Ui 


- i: Ua | = (i ab 
Uy FU M E Hat Wei Un+2 
which is of the required form. The result follows by the First Principle of Finite 
Induction. NH 


This apparently simple result is in fact rather powerful and leads to simple proofs of 
Fibonacci identities. For example, since A?" — (A" we have that 


2 2 2 
P) Un j= kee u, | iz xy tu, VES 
+1 


2 2 
Un Uni U, Un Uy — iUn Mei Up + Unta 


SO Uppy = U2; +u? and uz, = Up- 1M, + usu, s, for all integers n Si, 
Similarly, from the matrix identity A™*" = A". A" equating elements in the first 


row and second column gives the identity Um+n = Um—1Un + UmUn+ 1> whose 
inductive proof on B pages 289 and 290 was far from straightforward. 


CET —1 1 
Notice that the matrix A has multiplicative inverse ACL- | ^ of (You can 
check that AA! = l 4 =I, the identity matrix.) It follows that for each 


integer n > 1 the matrix A” has an inverse, namely (A +)". As is usual when 
dealing with exponents we abbreviate (A^! by A^", and take A? to be I. (The 
set of all powers of A, (A": ne Z}, is an infinite cyclic group in which the usual 
laws for exponents hold.) 
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In the SAQ that follows we obtain an explicit formulation for A^" in terms of 
Fibonacci numbers, and use the result to obtain more F ibonacci identities. 


(a) Prove, using Mathematical Induction, that A^" = (—1y [i 3 2 for all 
integers n> 1. PT 
(b) From (a), deduce the identity u2 — Un—1Uy4, = (71) 3, forn >= 1. 
(c) Prove that A" + (C 1» A^" = (u,., + u, ,)L fornz 1. 
(d) By multiplying both sides of the equation in (c) by A", show that 
isis (D tpn = (a F Uy 3) ty 
for all positive integers m and n with m > n. 


"E Sk l| ra bosser u 
woot Jeena 


—u i 
if so the result is true when n = 1. 
Uo 
Assume that the result is true for some integer k > 1, ie. 


Aes (A7! =(—1%} | Ui a! 
=U Uki 


Now 


= =) od 
~O+1) A-kA-i. (. «| "eei Uy, 
A A (—1) | 10 


T U- 


2 (-1y — (uua + uy) Ue 
Up. —u, 


m T Uy, 


=(=" | Ue 2 Fo 


which is in the required form, completing the proof. 


(b) arate (1 Unti Du, he L^ | 
m^ Un— 1} \Un Unta 
=(-17 Uniuni — Up 0 
= 0 Une qH, 1 — ue 
int Uy — 4 n 


But A^" A" — I and so (—1)'(u,,,u, , — u.)- 1, which amounts to the 
required identity. 


[To prove this identity by traditional methods is quite involved; see B page 294. ] 


u u u, —u 
n —1yA^"- qe n —1yv(—1y ntl n 
(c) A" + (-1J'A e P st CoD Ce ». n 
NELLE 0 
0 Uni Mua 


= (u,-, + wa) 
(d) Multiplying through the equation in (c) by A” produces 
APT 4 (1) A” = (ui, + thy) AT. 


Now equating the elements in the first row and second column of the 
matrices on either side of the equation gives the required formula. $ 


For each integer n, positive or negative, we have seen that the entries in the 
matrix A” are all integers, so that in this work so far we have dealt exclusively 
with the set of 2 x 2 matrices whose entries are in Z, which we denote by M(Z). 
At this point we are going to bring congruence into play. Choosing an integer 
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m > 1, we switch tack by considering matrices in M(Z,,), the set of 2 x 2 
matrices whose entries are residue classes modulo m. With each matrix in M(Z) 
we associate, in a natural way, a matrix in M(Z,,) by replacing each entry by its 
congruence class modulo m. For example if m — 4, then the matrix associated 

c 5o cV qi L-1] ft] B] 
with P A is E 7 = [s 2 It seems natural to use the 
notation [A] for the matrix associated through congruence modulo m with A. 
(If you have a phobia for the square bracket notation they can be dropped 
under the convention of always working with the set of least positive residues 
modulo m, replacing each integer by the unique residue to which it is congruent. 
In subsequent units we shall do this, but for conceptual clarity, we shall persist 
with the bracket notation in this section.) 


You may well be wondering what we hope to gain by introducing this new set 
of matrices. To signpost the way, recall that our goal is to show that for any 
positive integer m there is a Fibonacci number u, such that mlu,, that is, 

[u] = [0]. We shall achieve this by finding an integer t such that [A'] = [I], 
whereupon comparison of the elements in the first row and second column gives 
the required result. 

First we need the following result which essentially says that matrices are 
multiplied in M(Z,,) in the normal way, but using the residue class arithmetic. 


If X and Y are matrices in M(Z) then [X] [Y] = [XY]. 


Let X = E | and Y = i 7? Then 
at Va 


[XY] [E 11 + Xs Xıy2 + 2 
X3y, + XaYa X3V2 + Xay4 

“bos +X¥3] [V2 + xa] 

[x31 + X4¥3] | [xa + X44] 


mm lbi] + [x2] bs] [x] D2] + [x2] 55) (by definitions of 
[x3] [y1] + [x4] [y3] [x3] [y2] + [x4] Da] addition and 
multiplication in Zm) 


| (by definition of [A ]) 


[x1] [x2] 
[x3] [x4] 


fee [y2] 


= Yi. 
[1 5 AUS. M 


Use the result of the Lemma to show that [A ^"] is the multiplicative inverse of 


[A"] in M(Z,,). 


To talk about multiplicative inverses we first need an identity element in M(Z,). 
This is plainly [I], which is confirmed by the Lemma: 


K] [1] = XI] = [X] = M] = [I] [X]: 


Then. 
[A^] [A "] = [AA "] = [I] = [A "AÀ"] = [A "] [A"] 
shows that [A "] = [A^] - M 


And now for the main result. 
If m is any positive integer then there exists a Fibonacci number u, such 
that mlu,. 


We first remark that each matrix in M(Z,,) has four entries and that each of 
these entries can take only the m values [0], [1],..., [m — 1], so there are in all 
m^ distinct matrices in M (Zm). 
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Now consider the matrices 
[A], [A], [A°],..., [A"],.... 


As there are only a finite number of distinct matrices these cannot all be 
different. So, focussing on an equal pair, there exist positive integers s and t 
such that [A*] = [A**']. That is, 


[A*] = [A*] [A']. 
Multiplying both sides on the left by [A~*] gives 
[A'] = [I]. 
Spelling this out, 


Ded [4] )- (03 w 
[4] — [w1] 0]. [1] 


from which we see that [u,] — [0] or, what amounts to the same thing, mu. WM 


Having developed all this machinery it would be a pity not to give further illustration 
of its powerful applications. To this end, we close the section with an alternative 
proof of the main Fibonacci divisibility property, namely that u, |u,,, (Theorem 13-2, 
B page 290). To do this we change the modulus of our congruences to u,,, so that we 
consider matrices in M (Zun): In what follows, therefore, the square brackets refer to 
residue classes and associated matrices modulo u,,, rather than modulo m as 
previously. 


(a) Show that [A"] is a diagonal matrix and hence determine [A"", where m 
and n are positive integers. 


(b) Use the fact that [A"^] = [A"" (which follows from the Lemma) to show 
that u,[u,,. : 


(a) ies ren [un ] | i-a [0] | 


[um] [Mn+ 1] [0] [0,1] 
(remember we are now taking congruences modulo u). 


mm [m] 01 \*_ (Du. [0] 
Ie b all » Ts en 


(Note further that since u,,,, = Um + Uy — 4, We have [Uy,41] = [u, ,] so 
[A"]" is a scalar multiple of the identity matrix.) 


(b) [A""] = ie! [umn] | 


[imn] [tmnt] 


Comparing entries in the first row and second column of [A""] and [A"" 
we have [u,,,,] = [0]. That is, unu, | 


Problem Solving 


On the tape cassette accompanying this section we discuss the problems given 
below. If you feel confident that you can handle these problems without 
assistance, then by all means go ahead. (Solutions are given in the Exercise 
Booklet.) However, it should be emphasized that the tape contains much more 
than just solutions to the problems, and real benefit could be gained from the 
discussion of the various ways of attacking the problems. In particular, the real 
purpose of Problem 2 is to suggest a possible strategy for solving linear 
congruences in general. 
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START THE TAPE WHEN YOU ARE READY 


An integer a is both a square and a cube. Prove that 
a= 0,1,9 or 28 (mod 36). 

Solve the following linear congruences. 

(i) 28x = 49 (mod 53) 

(ii) 34x = 19 (mod 52) 

(iii) 137x = 96 (mod 19) 

(iv) 51x = 8 (mod 53) 

(v) 46x =74 (mod 98) 

Solve the simultaneous system of congruences 
2x + 3y = 1 (mod 100), 

3x — 2y = 1 (mod 100). 


Summary 


This unit has been concerned with the topic of congruence, whose applications 
we continue to investigate in the next unit and thereafter use extensively in the 
remainder of the course. 


Section 3.1 looked at basic properties of congruence. We found that for each 
positive integer n there is an arithmetic modulo n which has its own 
peculiarities. Theorem 4-2 and Theorem 4-3 of B give the important rules 
governing addition, multiplication and cancellation in arithmetic modulo n. 


Section 3.2 turned to simple applications of congruence and, in particular, 
established tests for divisibility by certain integers. 


Section 3.3 was concerned with solving linear congruences and solving systems 
of simultaneous linear congruences. Theorem 4-7 and the Chinese Remainder 
Theorem gave the respective conditions for solvability, but the section suggested 
alternative methods of solving each kind of problem. 


Section 3.4 returned to the Fibonacci sequence and gave an alternative 
approach to proving properties of the sequence which used matrix techniques. 
The power of this approach was illustrated by the comparative ease with which 
some of the seemingly difficult properties were derived. 


The main definitions and results encountered in this unit are listed in the 
corresponding entry in the Handbook. 
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Introduction 


The main result of this unit is Fermat’s Little Theorem, not to be confused with 
the probably better known Fermat’s Last Theorem which we shall encounter in 
Unit 8. Fermat’s Little Theorem states that, when working to prime modulus p. 
x?~! is congruent to 1 for every integer x which is not divisible by p. 


The result has numerous applications, one of the more important of which is its 
role in the area of solving polynomial congruences, which are congruences of 
the form P(x) = 0 (mod n) where P is a polynomial with integer coefficients, We 
begin an investigation of polynomial congruences and their solution in 

Section 4.4. We shall pick up this thread again in Unit 6 where we carry out a 
systematic study of the solution of quadratic congruences. 


In Section 4.1 we meet an ingenious algorithm for factorizing odd integers, 
discovered by Fermat. Section 42 is concerned with the proof and some simple 
consequences of Fermat's Little Theorem. Section 4.3 introduces another of the 
classical results in this area of number theory, known as Wilson's Theorem. 
Section 4.4 turns to polynomial congruences and is a tape-based section. 
Finally, in Section 4.5 we take another look at Fermat's and Wilson's 
Theorems. From their statements, any link between the two theorems 

must appear tenuous. However, we shall observe certain similarities in their 
proofs, and in this final section we illustrate how the two results arise out of the 
solutions of two similar, well-known combinatorial problems. 


The unit is based mainly on Chapter 5 of B. Section 44, however (most of 
whose material will not be found in B), culminates in a short reading passage 
from Chapter 8. 


Fermat's Factorization Method 


To investigate the primality of an integer n, the only method so far presented is 
to test n systematically for divisibility by each prime not exceeding n, in turn. 
In this section we shall meet an alternative method: an algorithm discovered by 
Fermat. Rather than requiring a knowledge of prime numbers, Fermat's method 
relies on a knowledge of the perfect squares. 


But first we learn a little about Fermat himself, one of the great men of number 
theory. 


READ B Chapter 5 from page 92 as far as the end of Section 5.2. 


Whilst there is a certain elegance about Fermat's method, it must be said that it 
is not a very efficient algorithm for factorizing a general integer n. As pointed 
out in B, there are favourable cases for Fermat’s Method, occurring when n is a 
product of two numbers of approximately equal size. But in the vast majority of 
cases, which are unfavourable, the amount of computation needed to find 
factors of n by Fermat's method far exceeds that involved in testing n for 
divisibility by each positive integer not exceeding Jn. 

B page 96, Problems 5.2, number 1, parts (a) and (b). 


[Note: B contains a list of squares, Table 5 in the Appendices. ] 


(a) The smallest positive integer k for which k? > 2279 is 48. 
k? — 2279 = 2304 — 2279 = 25 = 52 
Therefore 2279 = (48 + 5)-(48 — 5) = 53 - 43. 
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(b) 102? < 10541 < 1037, so we first take k = 103. Writing n = 10541, 
103? — n= 68, 
104 —n= 275, 
105? — n = 484 = 227. 
Therefore n = (105 + 22).(105 — 22) 2127.83. W 


Fermat’s Little Theorem 


The subject matter of this section is one of the classical results of number 
theory: Fermat's Little Theorem. The theorem is a special case of a more 
general result, Eulers Theorem, which we shall encounter in Unit 5. The 
usefulness of Fermat's and Eulers Theorems in number theory is difficult to 
exaggerate. 


READ B Section 5.3, page 97, as far as page 99, line — 5. 


Page 98, line 8 

In SAQ 15 of Unit 3 we proved that {0,a,2a,..., (n — 1)a] is a complete set of 

residues modulo n, provided gcd(a, n) = 1. It follows that for p prime and p/a, 

the integers a,2a,3a,..., (p — 1)a are congruent modulo p, in some order, to the 
non-zero elements in the set of least positive residues modulo p. A 


To further illustrate usage of Fermat's Theorem, let us look at: 


B page 102, Problems 5.3, number 10. 


(a) For ae(1,2.3....,p — 1}, pya and so Fermat's Theorem tells us that 
a?! = 1 (mod p). 
Therefore, adding these p — 1 congruences, 


12714227143»... (p-1?"z1-1-1-c---1 


=p—1=-—1 (mod p). 
(b) By the Corollary, a? = a (mod p) for all integers a. Therefore 


1242? 43? 4---+(p—1P? =14+24+34+---+(P-1) 


1 
==p-(p-—1), 
ae) 
mp. i * "| E A is an integer, 
since p is odd 
= 0 (mod p). 


Find the remainder when 7*° is divided by 17. 
B page 101, Problems 5.3, number 8. 


By Fermat’s Theorem, 716 = | (mod 17). Therefore 

7*9 = (7116? .78 = 17-78 = 7? (mod 17). 
Now 7? = 49 = —2 (mod 17), and so 

7^9 = 78 = (T? = (—2)* = 16 (mod17) 


and the remainder on dividing 7^? by 17 is 16. B 
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(a) If gcd(a, p) = 1 then pya and so a~! = 1 (mod p). Hence if x = a?-?b then 
ax = a: a^? b = q"^! b = b (mod p), showing that a^^? is a solution of 
ax = b (mod p). 


COMMENT In the notation of Z, (where the non-zero elements form a group 
under multiplication), Fermat's Theorem says: 


[a1] = [1], whenever [a] + [0]. 
That is, 
[a] [a^^] = [1], whenever [a] # [0], 


so that the non-zero element [a] has multiplicative inverse 
[a]! = [a^7?]. 
The linear equation [a] [x] — [b] ([a] ¥ [0]) has unique solution in Za 
[x] = [a] * [b] = [a”-?] [b]. 


(b) Since gcd 2, 31) = 1, the congruence 2x = 1 (mod 31) has a unique solution 
modulo 31 which, by part (a), is x = 229-1 = 22? (mod31). Now 


27? = (25)5 .24 = (3255 -24 = 15.2* = 16 (mod 31) 


and so x = 16 (mod 31) is the unique solution. 


Similarly, 6x = 5 (mod 11) has unique solution x = 6°-5 (mod 11). Now 
6 = —5 (mod 11) and so 


x= 69-5 =(—5)9-5=-1-5% = 1 (mod 11) 
since 5'° = 1 (mod 11) by Fermat's Theorem. 


The unique solution of 3x = 17 (mod 29) is 
x = 377.17 = 3327-17-30 (since 30 = 1 (mod 29)) 
= 37§.17-10 = 17-10 = 25 (mod 29) 
where we have used 37° = 1 (mod29) gi 


READ B from page 99, line — 4 to the end of Section 5.3. 


(i) Page 100, line —9 
The final step in the proof of the lemma follows immediately from our 
theorem on page 47 of Unit 3. It is worthwhile recalling the theorem at 
this stage for we shall appeal to it several times in what follows. 
(ii) Page 101. line 3 
An alternative way of obtaining the result 234" = | (mod 341) is as follows. 
By Fermat’s Theorem we know that 2!° = 1 (mod 11), and so 
2940 = (21)34 = 134 = 1 (mod 11), 
Further, since 2° = 32 = 1 (mod 31), we have 
2340 = (25)58 = 158 = 1 (mod 31). 
The theorem cited in Note (i) now gives 234° = 1 (mod 11 - 31). 
(iii) Page 101, line 12 
The number 561 is interesting in that, despite being composite, it has the 
property a°°° = 1 (mod 561) for all integers a which are relatively prime to 


561 (see SAQ 4). This shows that even if a’~1 = (mod n) for all integers a 
with gcd(a, n) = 1, it is still not necessarily the case that nis prime. A 


The next example further illustrates how we can use Fermat’s Theorem in 
conjunction with the theorem of Note (i) to obtain results about divisibility by 
composite integers. 
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XAMPLE 2 B page 101, Problems 5.3, number 2. 


(a) To show that at? = 1 (mod 35) it suffices to show that a'? = 1 (mod 5) and 
a‘? = 1 (mod7). 


Since gcd (a, 35) = 1 we known that 5a and 7/a, so that Fermat's Theorem 
gives a* = 1 (mod 5) and a? = 1 (mod 7). Therefore 


at? = (a*? = 1? = 1 (mod 5), 
a’? = (a6 = 1? = 1 (mod7), 
as required. 


(b) To prove 168|a° — 1 we shall show that aê = 1 (mod 3), af = 1 (mod 7) and 
aô = 1 (mod 8). Since 3, 7 and 8 are relatively prime in pairs it will follow 
that af = 1 (mod3-7-8). Given that ged(a,42) = 1 we know that 2/a, 3a 
and 7/a. From the latter, Fermat's Theorem gives aê = 1 (mod7). From 
3a Fermat’s Theorem gives a? = 1 (mod 3), whence 


a6 = (a?)? = 1? = 1 (mod 3). 


Finally, from 2a we have that a is odd and so a? = 1 (mod 8) (see B page 
22, third paragraph). But then 


a® = (à? = 1 (mod 8). 


(c) 133 = 7-19 and so ged(a, 133) = 1 implies that 77a and 19/a. Employing 
Fermat's Theorem, 


a!8 = (a°)3 = 13 = 1 (mod7) and a!? = 1 (mod 19) 
leading to the conclusion that at = 1 (mod 7-19). Similarly. 
b!8 = 1 (mod 7-19), and so a! — b!’ = 0 (mod 7-19). That is, 
133|at8 = 518: 
SAQ 4 Show that a6? = 1 (mod 561) whenever gcd(a, 561) = 1. 
SAQ 5 B page 101, Problems 5.3, number 4. 


SAQ 6 For any integer a show that à? and a have the same units digit. Deduce that 
a!9? has the same units digit as a’. 


SOLUTION 561 —3.11-17 and so if gcd(a, 561) = 1 then 3/a, 11a and 17/a. Fermat's 
TO SAQ 4 Theorem gives a? = 1 (mod 3), at? = 1 (mod 11) and a!$ = 1 (mod 17). 
Therefore, 


a56? E (a? y 80 


1289 — J. (mod:3); 
560 — (g10)56 = 156 = 1 (mod 11), 

4599 = (919935 = 135 = 1 (mod 17), 
whence a°°° = 1 (mod3.11.17) BM 


SOLUTION (a) By the Corollary to Fermat's Theorem we know 
TO SAQ 5 


a 


a =a(mod3) and a> za (mod5). 


Employing these facts in turn, 


a? = (a)! 2 a! = (œ Ya = &a = æ =a (mod 3) 


and 
a?! =(a)*a = ata = à? = a (mod 5). 
It follows that a?! = a (mod 3 - 5), as required. 
(b) By the Corollary to Fermat’s Theorem we know 


a? =a(mod2), a= a (mod3) and a’ =a (mod?) 
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The first two give, respectively, 


a! = (à Pa = asa = (PP = a (mod 2) 


and 
a’ = (a= a^a = à? =a (mod3). 
Now a’ = a (mod2-3-7), as required. 


(c) (Similar to (b).) From a? = a (mod 3) we deduce that 


at? = (a?)*a = ata = ja? = ag? = i? a (mod 3) 


and from a’ = a (mod 7) we obtain 


T 


a^? = a'a* = aa = a? =a (mod 7). 


Not forgetting a‘? = a (mod 13), we conclude that a!? = a (mod3.7.13) m 
Two integers have the same units digit if, and only if, they are congruent 


modulo 10. For any integer a we have à? = a (mod 5) and à? = a (mod2). From 
the latter, 


à? = (a?f'a = aa = @ =a (mod 2), 
and therefore a = a (mod 10), as required. 
100 _ 20 


9100 = (PP =a (since à? = a (mod 10)) 


(a°)* = a* (mod 10) 


and so a‘°° has the same units digit as a^. [| 


Consider an ordinary pack of 52 playing cards in any order. In the perfect faro 
shuffle the pack is cut into two equal halves which are then placed (flicked) 
alternately, as illustrated. 


er Car] 
[T1] Eta] 
[2875] 
e e * 
: — c n —- Us 
i [31 ] x 
52 26 ] ESZ] 
L3 ] 


After how many such shuffles will the pack return to its original order? 


The cards which initially occupy positions 1,2,...,26 are moved to positions 
2,4,...,52, whilst the cards initially in positions 27,28,... , 52 are moved to 
positions 1,3,...,51. Thus the card starting in position x ends up in position y 
where 1 X y < 52 and y = 2x (mod 53). After n such shuffles this card will 
occupy position 2"x (mod 53). 


If all cards return to their initial positions after n shuffles then 
2"x = x (mod 53), for all x such that 1 < x < 52. 
As 53 is prime we may cancel x to obtain 


2" = 1 (mod 53). 
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Now Fermat's Theorem gives us one positive value of n which satisfies this 
congruence: 2°* = 1 (mod 53). Hence after 52 shuffles the pack is restored to its 
original order. 


In fact n = 52 is the least positive solution of 2" = 1 (mod 53) (see SAQ 7), so 
the pack returns to its original order for the first time after the 52nd shuffle. 


Suppose now that we take a pack of m cards (where m is even) and apply 
perfect faro shuffles as before. The rules governing the movement of the cards 
are just as in the above example. After n shuffles the card which originally 
occupied position x is moved to position y where 1 < y < m and 

y = 2"x (mod m + 1). The pack returns to its original order after n shuffles if 
and only if 


2"x = x (mod m + 1), for all x such that 1 E x € m. 
Take for example the case m — 62. Since 


2° = 64 = 1 (mod 63) 
we have 
2°x = x (mod 63) for all x such that 1 < x < 62. 


Therefore, for a pack of 62 cards, after only 6 shuffles the original order is restored. 
Let n = ny be the least positive solution of 2" = 1 (mod 53). Use the Division 


Algorithm to show that nj|52. Confirm that 27° # 1 (mod 53) and deduce that 
ng = 52. 


Two jokers are added to an ordinary pack of playing cards. After how many 
perfect faro shuffles does the resulting pack of 54 cards return to its original 
order? 


From the Division Algorithm we can write 52 = qno + r, where 0 € r < ng. Now 
25? = 2" = 1 (mod 53) and so 


1 = 25? = Qmotr = (24. X = 14.27 = 2" (mod 53). 


Hence r = 0, for otherwise we contradict the definition of n, as least positive 
integer with this property, and therefore n|52. 


The possible values of ny, being a divisor of 52, are 1, 2, 4, 13, 26 or 52. The 
first three of these can be eliminated at once since clearly they do not satisfy 
2" = 1 (mod 53). Turning to 26, 

226 = $5.2 = 114-4 = (117)? -4=15-15-4=15-7 1 (mod 53). 
Finally, suppose 2'? = 1 (mod 53). Then 27° = (213? = 1? = 1 (mod 53), a 
contradiction. We conclude that nọ = 52. W 


The pack of 54 cards returns to its initial order after n shuffles provided 
2"x = x (mod 55), for all x such that 1 € x € 54. Since ged (2^, 55) = 1 for all 
n= 1, x may be cancelled and the required condition is 2" = 1 (mod 55). 


Now 55 — 5 11, and by Fermat's Theorem we know that 2^ = 1 (mod 5) and 
21? = 1 (mod 11). Therefore 


229 = (24 = 15 = 1 (mod 5) 
and 
220 = (219)? = 1? = 1 (mod 11), 
and so 2?? = 1 (mod 5.11), giving n = 20 as a solution to our problem. 


If there is a smaller solution than n = 20 then, as in SAQ 7, it must be a divisor 
of 20. But 21° = 1024 = 34 (mod 53), and the other divisors are easily 
eliminated. B 
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Wilson's Theorem 


In the proof of Fermat's Theorem we compared the product of the elements in a 
complete set of residues modulo p, excluding the element which is congruent to 
0, with the product of the numbers 1,2,..., p — 1. Observing that the two 
products are congruent modulo p, the latter being (p — 1)!, led to the result. 
More careful examination of the product 1-2-3...(p — 1) reveals that, with the 
exception of 1 and p — 1, the elements in the product break into disjoint pairs 
in such a way that the product of each pair is congruent to 1 modulo p. From 
this it is a short step to deduce that (p — 1)! = —1 (mod p), a result known as 
Wilson's Theorem. 


READ B Section 5.4, page 102, as far as page 104, line — 11. 


Page 103, line 16 

In SAQ 3 we showed that the unique solution x — a' of the congruence 

ax = 1 (mod p) is given by a’ = a^^? (mod p). In terms of Z,, the non-zero 
elements ([1], [2]..... [p — 1]) form a group under multiplication and so each 
element has an inverse, and [a]! = [a'] = [a?~?]. The next part of the proof 
observes that, with the exception of [1] and [p — 1], each element is distinct 
from its inverse, whereas [1]! = [1] and [p^ 1] = [p- 1 A 


As an illustration of the use of Wilson's Theorem let us look at: 
B page 107, Problems 5-4, number 9. 
Working modulo p, where p is an odd prime, we have 
= —(p—2),4= —(p—4),...,. 2k = —(p—2k),.... p-1s-—1 


So the even integers 2,4,6,...,p — 1 are congruent modulo p respectively to the 
negatives of the odd integers, — (p — 2), — (p — 4),...., —5, —3, — 1. Taking their 
product, 


2-4-6...(p—1) = -1- —3: —5... -(p- 2) 
= (—1Y2 1.3.5... (p— 2) (mod p). 
Now utilize Wilson's Theorem: 
—12(p—1) 21:2-3...(p—2)(p — 1) 
= (1-3-5...(p — 2) (2°4°6...(p — 1)) 
= (—1) F (1.3.5... (p — 2)}? (mod p). 
Multiplying both sides by (— iz. we conclude that 
63.9. (p— dfs (_A 2 odp 
B page 106, Problems 5.4, number 3. 


B page 106, Problems 5.4, number 1. 
B page 106, Problems 5.4, number 4. 


The above note suggested a systematic, though rather cumbersome, method of 
solving this problem. For each integer a we determine a^! from 

a~' = a? ^? (mod p). However, you should have been able to ‘spot’ the ten pairs 
corresponding to p — 23, particularly after making the following observation. If 
ab = 1 (mod p) then (—a)(— b) = 1 (mod p), and so (p — a) (p — b) = 1 (mod p). 
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For example, seeing that 4-6 = 1 (mod 23), we deduce immediately that 
(23 — 4)-(23 — 6) 2 19-17 = 1 (mod 23). 
From the divisors of 24 (=1 (mod 23)) we have the pairs 
2.122 3.824.621 (mod23) 


and these give, as above, 
21-11 = 20.15 2 19.17 = 1 (mod 23). 
Since 70 = 1 (mod 23), we have the further pairs 
5.14 2 7-10 = 1 (mod23) 
and, subtracting from 23, 
18.9 216.13 = 1 (mod 23), 
completing the list. Bi 


(a) In the proof of Wilson's Theorem we established that (p — 2)! = 1 (mod p), 
for p prime. Putting p = 17 gives 15! = 1 (mod 17). That is, the remainder 
on dividing 15! by 17 is 1. 


(b) If p > 3 is prime, then 
2(p — 3)! = —(p — 2)(p — 3)! = —(p — 2)! = — 1 (mod p). 
Putting p = 29 gives 


2.26! = —1 (mod 29) 
and so the remainder on dividing 2.26! by 29 is 28. BW 
437 = 19-23. To show that 18! = —1 (mod 437) it is sufficient to show that 
18! = —1 (mod 19) and 18! = —1 (mod 23). The former is given immediately 


by Wilson’s Theorem. 
Working modulo 23, Wilson’s Theorem gives 
—1 = 22! = 18!-19-20-21-22 
= 18!-(—4)-(—3)-(—2)-(—1) = 18!-24 = 18! (mod 23). B 


In the final reading passage of this section, quadratic congruences are 
introduced. As we shall shortly see, the theory of quadratic congruences has 
great importance in number theory, and Unit 6 of this course is devoted to their 
study. The result of Theorem 5-3 is crucial in this context and we shall refer 
back to it several times in the later units. 


READ B from page 104, line — 10 to the end of Section 5.4. 


Page 106, line 10 

Wilson's Theorem tells us that p|(p — 1)! + 1. Putting n = p — 1 gives 

n+ ln! 4- 1. As n! -- 1» n ^ 1 provided n > 2, we conclude that n! + 1 is 
composite whenever n + 1 is a prime greater than 3. A 


For an application of Theorem 5-3 look at 


B page 107, Problems 5.4, number 12. 


This exercise gives us a partial result en route to solving a classical problem, 
which we shall look at in more detail in Unit 8, namely, ‘which positive integers 
can be expressed as the sum of two squares? For example, 4 — 2 0, 

5 — 22 + 1? and 13 = 2? + 3? can be so expressed. If you experiment with the 
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numbers in the range 1 to 10 inclusive you will find that the only ones which 
cannot be expressed as a sum of two squares are 3, 6 and 7. Notice that each of 
these has a prime factor of the form 4k + 3. 


The result of the present exercise can be re-interpreted as follows. If p — 4k 4- 3 
is prime and if a multiple of p can be expressed as a sum of two squares 

(a? + b? = 0 (mod p)), then pla and p|b. But we can take this an immediate step 
further, for if pla and p|b then p?|a? + b?. This means that an integer which is a 
multiple of p but which is not a multiple of D^ cannot be expressed as a sum of 
two squares. This accounts for the failure with integers 3, 6 and 7. 


Having sketched the background to the problem, its solution presents little 
difficulty, thanks to Theorem 5-3. 


Suppose a? + b? = 0 (mod p) and assume pa. As in the proof of Wilson's 
Theorem, a has an inverse, that is, an integer c such that ac = 1 (mod p) 
Multiplying through the original congruence by c? gives 


ac? + be =0-c? 20 (mod p) 


or 
1 + (bc = 0 (mod p), 


telling us that bc is a solution of 1 + x2 20 (mod p). But this congruence has 

no solutions, by Theorem 5-3. The existence of the integer c is thereby 

contradicted, and we must have pla. Similarly. by the symmetry of the situation, p|b. 
SAQ 12 B page 107, Problems 5.4, number 11. 


SAQ 13 B page 107, Problems 5.4, number 10. 


SOLUTION From the proof of Theorem 5-3 we know that, for p = 1 (mod 4), the 


a "ERI ; 
TO SE T congruence x? = —1 (mod p) has solution x = i ! (mod p). Further, since 


(—1)? = 1, it is clear that x = — zu ! (mod p) is a second solution. These 


two solutions are incongruent modulo p (for x = —x (mod p) leads to 
x = 0 (mod p), which is not the case), and as we shall discover in the next 
section these two are in fact the only solutions. 


Our task is therefore to simplify the values + e7 ! (mod p) for the cases 


p = 29 and p = 37. 


—1 ' er 5 
For p = 29, ez ! = 14! Working modulo 29 we can simplify 14! by pairing 


the terms: 
14! = 1. (2-14): (3 - 10) - (4-7) - (5-6) (8-11): (9 -13)-12 
=1-—1-1-—1-1-1-1-12 = 12 (mod 29). 


So two solutions of x? + 1 20 (mod 29) are x = 12 (mod 29) and 
= —12 = 17 (mod 29). 
For p = 37 the required solutions are x = +18! (mod 37), which simplify to 


x = 6, 31 (mod 37). [It is not unreasonable to ‘spot’ the solution 6 here, giving 
—6 = 31 (mod 37) as the second solution.] BN 


SOLUTION (a) Retrace the argument on B page 105, starting at line 6. For the case 
TO SAQ 13 p = 4k + 3, the argument differs on line — 5 where (—1)9 ^? = —1, giving 


E 
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—1 , 
We conclude that E ! satisfies the congruence x? = 1 (mod p). But 


x = c1 (mod p) are the only solutions of this congruence (see the second 
paragraph of the proof of Wilson's Theorem on B page 103), and so either 


CXII E 


| ! = —1 (mod p). 
(b) Consider the product of all the even integers less than p: 
-1 
2-4:6...(p—1)— 30-0/2.1:9.3... Ez 


2 
—1 
= 2P- 1)/2. PTA ! 
ES 


= £2?" (mod p) by part (a) 
Now Fermat's Theorem tells us that 27^! = 1 (mod p), and so 
(22-012)? = 1 (mod p). 


But the only solutions of x? = 1 (mod p) are x = + 1 (mod p) and so 
20-9? = + 1 (mod p). Therefore 


2:4-6...(p —1) = +1 (mod p = 4k + 3) 


as required. BN 


Polynomial Congruences 


What values of x satisfy 
x29 + 3x1^ + 8x1? + 3x? + 6 = 0 (mod 7)? 


To solve a congruence such as this may, at first sight, appear to be an awkward 
task. But Fermat’s Theorem leads to immediate simplifications. 


If x = 0 (mod 7) then the left-hand side of the congruence is clearly seen to be 
congruent to 6 modulo 7, and so no x which is a multiple of 7 can be a solution 
of the congruence. Therefore in looking for solutions we may assume that 74x, 
which permits us to apply Fermat's Theorem, namely that x? = 1 (mod7). Then 
x29 = (x$5 . x? = 13. x? = x? (mod 7). 
Similarly, x!^ = x? (mod 7) and xt? = x* (mod 7). Therefore solving the original 
congruence is equivalent to solving 
x? + 3x? + 8x4 + 3x? + 6 = 8x* + 7x? + 6 = 0 (mod 7). 
Simplifying further by reducing the coefficients (8 = 1 (mod 7), etc.), we arrive at 
xt — 1 = 0 (mod7). 


Checking the fourth powers of +1, +2 and +3 we find that this congruence 
has just two solutions, namely x = +1 (mod 7). These two are also the solutions 
of the original congruence. 


This example illustrates an important role played by Fermat's Theorem towards 
the solution of ‘polynomial congruences’, enabling us to replace the polynomial 
in question by an equivalent one of smaller degree. In this section we are going 
to be wholly concerned with polynomial congruences and their solutions. We 
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shall discover that when the modulus of the congruence is composite, 
unexpected and unwanted things happen. But when the modulus is prime the 
situation is more clear-cut. 


The culmination of the section is a theorem of Lagrange which gives an insight 
into the number of solutions that a polynomial congruence with prime modulus 
can have. 


START THE TAPE WHEN YOU ARE READY 


Definitions 
A polynomial congruence is an expression 

P(x) = a,x" + a,_ x7) 4 +++ + a,x + ay = 0 (modn) 
where P is an integral polynomial (that is, a polynomial whose coefficients are 
integers). 
Integer b is a solution of P(x) = 0 (mod n) if and only if P(b) = 0 (mod n). 
The number of solutions of P(x) 2 0 (mod n) is the number of incongruent 


solutions modulo n. That is, the number of solutions in a complete set of 
residues, for example {0,1,2,...,n — 1}. 


Degree 


The degree of the polynomial congruence P(x) = 0 (mod n) is defined as the 
highest exponent in P whose coefficient is not congruent to 0 modulo n. 


Examples 

(i) 3x° — x? + 2 = 0 (mod 4) has degree 6 

(i) 5x* + 6x? + 17x = 0 (mod 5) has degree 3 
(iii) 6x? + 3x? + 9x + 2 = 0 (mod 3) has degree 0 
(iv) 7x? — 14x + 7 =0 (mod 7) has degree ? 


A polynomial all of whose coefficients are congruent to 0 modulo n (the zero 
polynomial over Z,) has no degree associated with it. 


Examples of solutions modulo 3 


P(x) P(0) P(1) P(2) Solutions of P(x) = 0 (mod 3) 
x? = 1 2 0 0 1,2 

+1 1 2 2 None 

x+x4+1 1 0 1 1 

4x? 42 a 0 0 1,2 

x -—x 0 0 0 0,1,2 

Ptr tisy? 5 0 0 1,2 


Congruent polynomials 


Polynomials P, and P, are said to be congruent modulo n if and only if they 
take the same value at each integer. That is, PL (x) = P(x) (mod n) for each 
integer x. 


76 


RAME 6A 


FRAME 5 


FRAME 6 


FRAME 7 


M335 4.4 
Examples 
x? +x? 42x +2 = 4x? + 2 (mod 3) 


10x? — 2x? + 7 = 3x? +2(mod5) (Here corresponding coefficients are 
congruent modulo 5.) 


SAQ 14 
Use Fermat’s Theorem to show that 


x9 + 233 4 x* — 230 — x? — 1 = x* 4+ 3x3 + 2x + 4 (mod 5). 


SAQ 15 

Solve the polynomial congruences 
(i) x? — x = 0 (mod 6), 

(ii) x? — 1 20 (mod8), 

(iii) x? — x? = 0 (mod 8), 

(iv) x? — 120 (mod 7), 

(v) x? +x+1=0 (mod?) 


Solution to SAQ 15 


(i) By the Corollary to Fermat's Theorem, x? = x (mod 3) for all x. But 
x? = x (mod 2), and so x? = x (mod 2:3). That is, x? — x = 0 (mod 6) has 
solutions x = 0,1,2,3,4 and 5 (mod 6). 

(ii) x? = 1 (mod 8) if and only if x is odd. (See B page 22, paragraph 3.) 
Therefore x = 1,3,5 and 7 (mod 8) are the solutions. 

(ii) x? — x = x3(x? — 1) = 0 (mod 8) 
x =1,3,5,7 (mod8) = > x? —1=0 (mod8) 
x =0,2,4,6 (mod8) = > x? =0 (mod8) 
Therefore x? — x? = 0 (mod 8) for all integers x. 


(iv) x =1,2,4 (mod 7) all have x? = 1 (mod 7), and these are the three 
solutions. 


(v) Since xê — 1 = (x — 1) (x? + x + 1), any solution of 
x? + x + 1 =0 (mod 7) must also be a solution of x? — 1 = 0 (mod 7), of 
which there are three, found in part (iv). Trying x = 1,2,4 (mod 7) we find 
that x = 2,4 (mod 7) are the solutions of x? + x 4-120 (mod 7. M 


Factorizing polynomial congruences 
(i) Composite modulus 


x? —x=0(mod6) <> x=0,1,2,3,4,5 (mod 6). 
But 


x? — x = x(x? — 1) = x(x — 1) (x + 1) = 0 (mod 6) 


| 
x = 0 (mod 6) x = 1 (mod6) x = 5 (mod6) 
(ii) Prime modulus 
(a) x?+x+1=0(mod7) <= x=2,4 (mod7) 
(x — 2)(x — 4) = x? — 6x + 8 = x? + x + 1 (mod7) 
(b) x? — 1 =0 (mod7) <> x=1,2,4 (mod7) 
x? — 1 = (x — 1) (x? + x + 1) = (x — 1) (x — 2) (x — 4) (mod 7) 
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FRAMES Roots of polynomials 


If x, is a root of polynomial P(x) Example 
then 1, —2 and 3 are roots of 

P(x) = (x — x,)P(x). P(x) = 3x* — 9x3 — 9x? + 33x — 18. 
If x, is a second root, x, # Xs Therefore P(x) = (x — 1)P, (x), where 
then 


P,(x) = 3x3 — 6x? — 15x + 18. 
Now P, (x) = (x + 2)P, (x), where 
P,(x) = 3x? — 12x +9 
= (x — 3)P;(x), where 
P (x)= 3x — 3. 
Therefore 


P(x) = (x — 1)(x + 2)(x — 3) (3x — 3). 


P(X2) = (x, — x1) Py (x2) = 0 
so that P,(x,) = 0. Therefore 
P, (x)= (x — x3)P4(x) 
giving 
P(x) = (x — xj)(x — x3)Ps(x). 
Continuing, 


P(x) = (x = x4) (x = x3)...(x — x,) P,(x) 


shows that if P has more than n 
roots then the degree of P must 
exceed n. 


FRAME 9 Division Algorithm for polynomials 


Given polynomials a(x) and b(x) there exist polynomials q(x) and r(x) such 
that 


a(x) = q(x)  b(x) + r(x) 


where r(x) = 0 or the degree of r is less than the degree of b. 


Example 
x? goes into 2x? 
2x times 


2x +7 < 
x? -2x 4 1) 2x8 4 3x2 — 5x +4 


2x3 — 4x? + 2x 


x? goes into 7x? 


Te =. "sd. 
di pipa 
7x —3 


... and subtract 


Multiply 
x? — 2x +1 by 7... 
... and subtract 


2x? + 3x? — 5x + 4 = (2x + 7): (x? — 2x 1) - 7x - 3 


BUT 5:3 
3x t+ 
3x+1) x74+ x41 
x? +4 Dividing x? + x + 1 by 
» 3x + 1, neither quotient 
ax +1 nor remainder are 
2x + integral polynomials. 


v|- [oly 
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AME 10 Theorem 


FRAME 11 


If a(x) and b(x) are integer polynomials with the leading coefficient of b(x) 
equal to 1, then there exist integer polynomials q(x) and r(x) such that 


a(x) = q(x): b(x) + r(x) 
where r(x) = 0 or degree r(x) < degree b(x). 
Proof 
Consider the set S of integer polynomials 
S = {a(x) — e(x): b(x):c(x) any integer polynomial}. 


If the zero polynomial belongs to S, then a(x) = c(x)- b(x) + 0, for some 
integer polynomial c(x), and this is of the required form. Otherwise, let r(x) be 
a polynomial in S having least degree, say 


r(x) = a(x) — q(x):b(x) 


a(x) = q(x): b(x) + r(x). 
We shall show that degree r(x) < degree b(x) by contradiction. If 


degree r(x) = degree b(x) then 
b(x) = x" b, yx" +--+ bix + b, 
and 
r(x) = r,x"** + rx "*571. + -where s 2 0. 

Consider a(x) — (q(x) + r,x^)b(x), which belongs to S. 
a(x) — (q(x) + rix*)b(x) 

= r(x) — r, xb(x) 

—oguxm*5 e a pe — (rux" *5 rub, qx" 7571 ee) 

—(rQ—rb, oxi tee. 


This member of S has degree less than m + s, which contradicts the fact that 
r(x) has least degree. Bi 


READ B from Theorem 8-5 on page 162 to page 164, line — 11. 


Note on Lagrange’s Theorem 


From examples seen, we know that the condition p be prime is essential. Indeed, 
working to any composite modulus n we can always find a polynomial of degree 
1 for which the corresponding congruence has at least two incongruent 
solutions. For, if n = ab where 1 < a € b <n then the congruence 

ax + c= 0 (mod n) has gcd (a, n) = a incongruent solutions, whenever alc. 

This observation pinpoints the first place where the primality of p is needed in 
the proof of Lagrange’s Theorem. The case n = 1, launching the induction, is 
false for composite p. The primality is needed once more, on page 163, lines 7 
and 8. Arriving at (b — a)q(b) = 0 (mod p) and pJb — a, Theorem 3-1 (B page 
47) gives p|q(b) so that we deduce q(b) = 0 (mod p), as required. 


END OF TAPE 
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SAQ 16 Find the quotient and remainder when a(x) is divided by b(x), where 
(i) a(x)222 —x41, b(x)=x+1; 
(i) a(x)22x*—1, b(x)=x — x-1. 


SAQ 17 Solve the following congruences. 
G) x? — 2x + 120 (mod11) 
(ii) x!9 — x8 + 5x3 — 5x = 0 (mod 7) 
SAQ 18 Prove that the congruence 
XPT? px? poop x? + x + 1 = 0 (mod p), 


where p is an odd prime, has exactly p — 2 incongruent solutions, which are 
x = 2,3,...,p — 1 (modp). 


SOLUTION (i) BP al 
TO SAO 1 x +1)2x3 — x+1 
a O O 
— 2x?- x41 
= 2x^ Dre u 
x+1 
SI. 
0 
q(x) = 2x? - 2x +1 
r(x) 20 
(ii) 2x? + 2x +4 


grey Sade —1 


2x* = 2x? — 2x? 


2x3 + 2x? —1 


Dont Ie 
4x? 42x — 1 
4x? — 4x — 4 


q(x) = 2x? + 2x +4 
r(x)=6x+3 m 
SOLUTION (i) Noticing that x = 1 (mod 11) is one solution, we can factorize 
TOSAQ I E he Te f) 
= (x — 1)(x? + x — 12) 
= (x — 1) (x — 3) (x + 4) = 0 (mod 11). 


Therefore x = 1 (mod 11), x =3 (mod 11) and x = —4 = 7 (mod 11) are 
solutions, and by Lagrange’s Theorem, these are the only solutions. 


(ii) We first simplify the congruence, using the fact that x" = x (mod 7), for all 
x. This gives 


xt + 5x3 — x? — 5x = x(x? + 5x? — x — 5) = 0 (mod 7). 
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Spotting that x = 1 is a root of the second factor, 
x(x? + 5x? — x — 5) = x(x — 1) (x? + 6x + 5) 
= cbe— 1) (t= 2) 
= x(x — 1)(x + 1)(x — 2) = 0 (mod 7). 


As the original congruence is equivalent to one of degree 4, Lagrange’s 
Theorem tells us that the resulting four solutions, x = 0,1,2 and 6 (mod 7), 
are the only solutions. Bl 


By Fermat’s Theorem, each of the integers 1,2,...,p — 1 is a solution of the 
congruence x?~' — 1 = 0 (mod p). Now 
xet f= (x — 1) (xP? + xP Fee $7 4x4 1). 


Since none of the integers 2,3,...,p — 1 satisfy x — 1 = 0 (mod p) it follows, by 
the proof of Lagrange’s Theorem, that each of them is a solution of 
xP? + xP? 4 ...4x7+x+1=0(modp). M 


Fermat’s and Wilson’s Theorems Revisited 


The results of Fermat’s and Wilson’s Theorems are illustrated in the solutions of 
two well-known combinatorial problems. 


Having enough beads to permit unlimited use of each of n colours, how many 
different necklaces consisting of p beads can be made, where p is prime? 


We shall assume that the necklaces are produced by first forming a string of p 
beads and then joining the ends. Our first step is therefore to count how many 
different strings of p beads there are. Well that’s fairly straightforward. There are 
n choices for the first bead, n choices for the second, and so on. In all there are 
n? different strings. 


Notice that, of these strings, n consist of beads of one colour alone (one such for 
each colour). Hence there are n? — n strings involving at least two colours. As 
the single colour necklaces are easily counted, we shall put these on one side 
and concentrate on the ones involving two or more colours. 


As one must expect, strings counted differently in this way can result in the 
same necklace being produced. For example, the five strings of beads involving 
colours A, B and C illustrated below, all result in the same necklace. 


( (9 G ® © 
() (9 © (9 
(9 (O (9 (9 
(OO & ® (9 
0000A 


OmOmo 


Notice that the strings in this example are obtained from one another by 
removing the top bead and replacing it at the bottom of the string. 


We shall call the operation of moving a bead from the top to the bottom of a 
string a ‘move’. Two strings which can be obtained from each other by a 
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sequence of moves clearly correspond to the same necklace. From a given string 
of p beads, how many different strings can be obtained by moves? The obvious 
answer is p, unless after some fewer number of moves the original colour scheme 
is restored. We use the Division Algorithm to show that this cannot happen. 


Let k be the least number of moves which restore the colour scheme of a string 
of p beads (k > 1 since we are ignoring strings of a single colour). By the 
Division Algorithm, there exist integers q and r such that 


p=qk+r, whereO<r<k 


If k moves restore the colour scheme, then so too do 2k, 3k,...,qk moves. 
Furthermore, p moves certainly restores the colour scheme, since each bead 
returns to its initial position. Therefore, after the qkth move only a further r 
moves are needed to restore the colour scheme, and since r < k, we are forced 
to conclude that r = 0. This in turn implies that k divides p, and since p is prime 
and k > 1 we must have k = p. 


The outcome of this is that the n? — n strings involving two or more colours are 
partitioned into disjoint sets of p strings, each set consisting of all strings which 
can be obtained from each other by a sequence of moves. This means that 

p|n? — n, which is Fermat’s Little Theorem. 


n -—n 


p 
different necklaces involving two or more colours and hence a total of 


nm —n 


One might be tempted to conclude from this that there are in all 


+ n different necklaces, Unfortunately there is a further complication 


which we have not taken into account. Consider the string shown. It is the first string 
of our earlier example but with order reversed (ie. turned over). It gives rise to 
a further set of five strings which correspond to the same necklace (turned over) 
in the example. 


© 
(4) 
(8) 
O 
Q 


So, in general, two different sets of p strings might give rise to the same 

necklace. We say ‘might’ because turning a string over does not always produce 

a new string—or, put another way, some necklaces are symmetrical! 

We leave you the task of completing Problem 1 as a challenge problem. For the 
n'—pm nPI yn 2 he , 

record, the answer is 2 e 2 + n. Notice that as this is an integer 

p 
value, the first term still incorporates Fermat’s Little Theorem. 


A similar problem leads to an illustration of Wilson's Theorem. 
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ROBLEM 2 How many stellated p-gons are there? 


A stellated n-gon is formed by placing n points at equal distances around the 
circumference of a circle and then joining these points with n straight-line 
segments, crossings being allowed. For n = 5, there are twelve stellated 


pentagons, as shown in the figure. 
Notice that the five pentagons in the first row are congruent, being obtained 
. 7) ul a ; 
from each other by rotations through T Similarly, the pentagons in the second 
row are congruent. The remaining two are the regular stellated pentagons, 


, ! 2n ; , k 
having rotation through um as a symmetry operation. Hence, if we classify 


stellated pentagons up to similarity (for we are not concerned about size) there 
are just four essentially different ones. 


In the same way, we now want to count the number of different stellated p-gons, 
where p is a prime (greater than 2). We leave you to confirm the details, which 
are as follows. 


Placing p points at equal distances around the circumference of a circle, the 
total number of stellated p-gons that can be formed by joining them 
(corresponding to the twelve for p = 5) is (p — 1)!. The number of regular ones 
is ¿(p — 1). [The regular stellated p-gons are formed by choosing an integer n in 


-1 "T : 
the range 1 < n < p and joining each point to the one n further round the 


circle, clockwise. ] The remaining ¿(p — 1)! — ¿(p — 1) irregular stellated p-gons 
are partitioned into congruent sets of p each, the members of each set being 


" 2 . 
obtained from each other by rotations through = Hence there are in all 
p 


i-0)!'-30-1,, (p—-1)!—(p—1) , p-1 
+ 3(p — 1) | 
p 2p 2 
different stellated p-gons. As this is an integer, the first term illustrates that 
pl(p — 1)! — (p — 1). That is, (p — 1)! = (p — 1)  —1 (mod p), which is Wilson's 
Theorem. 
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Summary 


After a brief look at Fermat’s Factorization Method, the remainder of the unit 
hinged on Fermat's Little Theorem and its applications. The theorem says that 
for a prime p, if px then x?! = 1 (mod D). Catering for the case p|x we have 

the Corollary that x? = x (mod p) for all integers x. 


A variation in the proof of Fermat's Theorem led to Wilson's Theorem, which 
says that (p — 1)! = —1 (mod p) for any prime p. 


Looking at applications of these two theorems we have begun a study of 
polynomial congruences and their solutions. To this end we have uncovered two 
important results. First we have seen that the congruence x? + 1 = 0 (mod p) 
has no solutions when p = 3 (mod 4) and has two solutions when p = 1 (mod 4). 
Second, Lagrange's Theorem tells us that in any polynomial congruence to 
prime modulus, the number of solutions cannot exceed the degree of the 
polynomial. 


The main definitions and results encountered in this unit are listed in the 
corresponding entry in the Handbook. 
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